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IMPORTANT NOTICE 


Texas Instruments (TI) reserves the right to make changes to its products or to discontinue any 
semiconductor product or service without notice, and advises its customers to obtain the latest 
version of relevant information to verify, before placing orders, that the information being relied 
on is current. 


Tl warrants performance of its semiconductor products and related software to the specifications 
applicable at the time of sale in accordance with Tl’s standard warranty. Testing and other quality 
control techniques are utilized to the extent Tl deems necessary to support this warranty. 
Specific testing of all parameters of each device is not necessarily performed, except those 
mandated by government requirements. 


Certain applications using semiconductor products may involve potential risks of death, 
personal injury, or severe property or environmental damage (“Critical Applications”). 


TI SEMICONDUCTOR PRODUCTS ARE NOT DESIGNED, INTENDED, AUTHORIZED, OR 
WARRANTED TO BE SUITABLE FOR USE IN LIFE-SUPPORT APPLICATIONS, DEVICES 
OR SYSTEMS OR OTHER CRITICAL APPLICATIONS. 


Inclusion of TI products in such applications is understood to be fully at the risk of the customer. 
Use of Tl products in such applications requires the written approval of an appropriate TI officer. 
Questions concerning potential risk applications should be directed to Tl through a local SC 
sales office. 


In order to minimize risks associated with the customer’s applications, adequate design and 
operating safeguards should be provided by the customer to minimize inherent or procedural 
hazards. 


Tl assumes no liability for applications assistance, customer product design, software 
performance, or infringement of patents or services described herein. Nor does Tl warrant or 
represent that any license, either express or implied, is granted under any patent right, copyright, 
mask work right, or other intellectual property right of Tl covering or relating to any combination, 
machine, or process in which such semiconductor products or services might be or are used. 


Copyright © 1995, Texas Instruments Incorporated 
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The purpose of this user’s guide is to provide the Tl customer with information 
on 'C5x digital signal processors. This manual can also be used as a reference 
guide for developing hardware or software applications. The following list 
summarizes the contents of the chapters and appendices in this user’s guide. 


How to Use This Manual 


Chapter 1 


Chapter 2 


Chapter 3 


Chapter 4 


Chapter 5 


Chapter 6 


Chapter 7 


This document contains the following chapters: 


Introduction 
Summarizes the TMS320 family of products. Gives a general description, lists 
the key features, and presents some typical applications of the ‘C5x devices. 


Pinouts and Signal Descriptions 
Lists pin locations with associated signals, categorizes signals according to 
function, and describes signals. 


Architecture 
Gives a general architectural overview with a functional block diagram. 
Describes the 'C5x design, hardware components, and device operation. 


Assembly Language Instructions 

Lists instructions by function. Provides alphabetized individual instruction 
descriptions with examples. Includes ’C2x-to-’C5x instruction set mapping 
and instruction cycle times and opcodes. 


Peripherals 
Describes peripheral control, serial ports, software-programmable wait states, 
and timing circuits. 


Memory 

Discusses program/data memory operation and configuration (with memory 
maps), 1/O space, external interface considerations, DMA operation, and 
memory management. 


Software Applications 
Explains the use of ’C5x instruction set with particular emphasis on its new 
features. Includes code examples for various DSP applications. 
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Appendix A 


Appendix B 


Appendix C 


Appendix D 


Appendix E 


Appendix F 


Appendix G 


Appendix H 


Appendix | 


Electrical Specifications 
Provides design documentation for the ’C5x devices. This data is based upon 
design goals and modeling information. 


External Interface Timing 
Provides functional timing of operation on the external interface bus. 


Instruction Cycle Timings 
Details the instruction cycle timings organized in different classes. 


TMS320C5x System Migration 
Provides information for upgrading a ’C25 system to a’C5x system. Includes 
package dimensions and pinouts, timing similarities and differences, 
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development tool enhancements. 


XDS510 Design Considerations 
Provides information to meet the design requirements of the XDS510 emulator 
and to support XDS510 Cable #2563988—001 Rev. B. 


Analog Interface Peripherals and Applications 
Describes a variety of devices that interface directly to the TMS320 DSPs for 
various communication and multimedia applications. 


Memories, Sockets, and Crystals 

Provides product information regarding memories and sockets manufactured 
by Texas Instruments that are compatible with the ’C5x. Information is also 
given regarding crystal frequencies, specifications, and vendors. 


ROM Codes 
Outlines the procedural flow for submitting code and ordering TMS320 
mask-programmed ROM-based DSPs from Texas Instruments. 


Development Support 
Provides a description of the 'C5x development support tools. 


Related Documentation 


The following books describe the TMS320 fixed-point devices and related 
support tools. To obtain a copy of any of these Tl documents, call the Texas 
Instruments Literature Response Center at (800) 477-8924. When ordering, 
please identify the book by its title and literature number. 


TMS320C2x/C5x Optimizing C Compiler User’s Guide (literature number 
SPRU024) describes the ’C2x/C5x C compiler. This C compiler accepts 
ANSI standard C source code and produces TMS320 assembly 
language source code for the ’C2x and ’C5x generations of devices. 
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TMS320C5x C Source Debugger User’s Guide (literature number 
SPRUO055) tells you how to invoke the ’C5x emulator, SWDS, EVM, and 
simulator versions of the C source debugger interface. A tutorial 
introduces basic debugger functionality and discusses various aspects 
of the debugger interface, including window management, command 
entry, code execution, data management, and breakpoints. 

TMS320 Fixed-Point DSP Assembly Language Tools User’s Guide 
(literature number SPRU018) describes the assembly language tools 
(assembler, linker, and other tools used to develop assembly language 
code), assembler directives, macros, common object file format, and 
symbolic debugging directives for the 'C1x, ‘C2x, and 'C5x generations 
of devices. 


TMS320C5x Evaluation Module Technical Reference (literature number 
SPRU087) describes the 'C5x EVM, its features, design details and 
external interfaces. 


A wide variety of related documentation is available on digital signal 
processing. These references fall into one of the following application 
categories: 


[_.) Digital contro! systems 
[} Digital signal processing 
[J Image processing 

(.} Speech processing 


Within those areas, the references appear in alphabetical order according to 
author. The documents contain beneficial information regarding designs, 
operations, and applications for general and/or specific signal-processing 
systems as well as circuits; all of the documents provide additional references. 
Therefore, Texas Instruments strongly suggests that you refer to these 
publications. 


Digital Control Systems: 


1) Jacquot, R., Modern Digital Control Systems, New York, NY: Marcel 
Dekker, Inc., 1981. 


2) Katz, P., Digital Control Using Microprocessors, Englewood Cliffs, Nu: 
Prentice-Hall, Inc., 1981. 


3) Kuo, B.C., Digital Control Systems, New York, NY: Holt, Reinholt, and 
Winston, Inc., 1980. 


4) Moroney, P., /ssues in the Implementation of Digital Feedback 
Compensators, Cambridge, MA: The MIT Press, 1983. 


5) Phillips, C., and H. Nagle, Digital Control System Analysis and Design, 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1984. 
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Digital Signal Processing: 


1) Antoniou, A., Digital Filters: Analysis and Design, New York, NY: 
McGraw-Hill Company, Inc., 1979. 


2) Brigham, E.O., The Fast Fourier Transform, Englewood Cliffs, NJ: 
Prentice-Hall, Inc., 1974. 


3) Burrus, C.S., and T.W. Parks, DFT/FFT and Convolution Algorithms, New 
York, NY: John Wiley and Sons, Inc., 1984. 


4) Gold, Bernard, and C.M. Rader, Digital Processing of Signals, New York, 
NY: McGraw-Hill Company, Inc., 1969. 


5) Hamming, R.W., Digital Filters, Englewood Cliffs, NJ: Prentice-Hall, Inc., 
1977. 


6) IEEE ASSP DSP Committee (Editor), Programs for Digital Signal 
Processing, New York, NY: IEEE Press, 1979. 


7) Jackson, Leland B., Digital Filters and Signal Processing, Hingham, MA: 
Kluwer Academic Publishers, 1986. 


8) Jones, D.L., and T.W. Parks, A Digital Signal Processing Laboratory Using 
the TMS32010, Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


9) Lim, Jae, and Alan V. Oppenheim, Advanced Topics in Signal Processing, 
Englewood Cliffs, NJ: Prentice- Hall, Inc., 1988. 


10) Morris, Robert L., Digital Signal Processing Software, Ottawa, Canada: 
Carleton University, 1983. 


11) Oppenheim, Alan V. (Editor), Applications of Digital Signal Processing, 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 


12) Oppenheim, Alan V., and R.W. Schafer, Digital Signal Processing, 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1975. 


13) Oppenheim, A.V., A.N. Willsky, and 1.T. Young, Signals and Systems, 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1983. 


14) Parks, T.W., and C.S. Burrus, Digital Filter Design, New York, NY: John 
Wiley and Sons, Inc., 1987. 


15) Rabiner, Lawrence R., and Bernard Gold, Theory and Application of 
Digital Signal Processing, Englewood Cliffs, NJ: Prentice-Hall, Inc., 1975. 


16) Texas Instruments, Digital Signal Processing Applications with the 
TMS320 Family, 1986; Englewood Cliffs, NJ: Prentice-Hall, Inc., 1987. 
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17) Treichler, J.R., C.R. Johnson, Jr., and M.G. Larimore, A Practical Guide 


to Adaptive Filter Design, New York, NY: John Wiley and Sons, Inc., 1987. 


Image Processing: 


1) 


2) 


3) 


Andrews, H.C., and B.R. Hunt, Digital Image Restoration, Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1977. 


Gonzales, Rafael C., and Paul Wintz, Digital Image Processing, Reading, 
MA: Addison-Wesley Publishing Company, Inc., 1977. 


Pratt, Willaim K., Digital Image Processing, New York, NY: John Wiley and 
Sons, 1978. 


Speech Processing: 


1) 


2) 


3) 


4) 


Gray, A.H., and J.D. Markel, Linear Prediction of Speech, New York, NY: 
Springer-Verlag, 1976. 


Jayant, N.S., and Peter Noll, Digital Coding of Waveforms, Englewood 
Cliffs, NJ: Prentice-Hall, Inc., 1984. 


Papamichalis, Panos, Practical Approaches to Speech Coding, Engle- 
wood Cliffs, NJ: Prentice-Hall, Inc., 1987. 


Rabiner, L.R., and R.W. Schafer, Digital Processing of Speech Signals, 
Englewood Cliffs, NJ: Prentice-Hall, Inc., 1978. 
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Style and Symbol Conventions 
This document uses the following conventions. 


[) Program listings and program examples are shown in a special 
typeface similar to a typewriter’s. 
Here is a segment of a program listing: 
OUTPUT: 


LDP #0 sdata page 0 

RPT #63 ;Output 64 values from a table at 800h 
LMMR 50h, 800h sin data memory to port 50h. 

RET 


[-} In syntax descriptions, the instruction is in bold typeface font and 
parameters are in italic typeface. Portions of a syntax in bold should be 
entered as shown; portions of a syntax in italics describe the type of 
information that you specify. Here is an example of an instruction syntax: 


[label BLDD src, dst 


BLDD is the instruction, which has two parameters indicated by src and 
dst. When you use BLDD, the first parameter must be an actual data 
memory source address and dst a destination address. A comma and a 
space must separate the two addresses. 


(.} Square brackets ( [ and ] ) identify an optional parameter. If you use an 
optional parameter, you specify the information within the brackets; you 
do not type the brackets themselves. In the example above, instead of 
typing [/abel], you specify a name for the label. When you specify more 
than one optional parameter from a list, you separate them with a comma 
and a space. 


[] Braces ( {and} ) indicate alist. The symbol | (read as o/) separates items 
within the list. Here’s an example of a list: 


ind: { * | *+ | *— | *0+ | *0— | *BRO+ | *BRO—} 
that provides seven choices. 


Unless the list is enclosed in square brackets, you must choose one item 
from the list. 


Information About Notes and Cautions 
This book may contain notes and cautions. 


() A note describes a preferred way or recommended procedure. 


aa a a a | 
Note: 


This is what a note looks like. 
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Trademarks 


[Jj A caution describes a situation that could potentially damage your 
software or equipment. 


The information in a note or caution is provided for your protection. Please read 
it carefully. 


MS-DOS and MS-Windows are trademarks of Microsoft Corp. 
DEC, VAX, and VMS are trademarks of Digital Equipment Corp. 
HP is a trademark of Hewlett Packard Co. 

Macintosh and MPW are trademarks of Apple Computer Corp. 
PC-DOS is a trademark of IBM Corp. 

Sun 3 and Sun 4 are trademarks of Sun Microsystems, Inc. 
UNIX is a trademark of UNIX System Laboratories, Inc. 


lf You Need Assistance. . . 


“Do this... 
Request more information about Call the LRC (Literature Response Center): 
Texas Instruments Digital Signal (800) 477-8924, 8:00—17:00 CST 


Processing (DSP) products or 
oder T! documentation 


‘Ifyou wantto... 


Or write to: 
Texas Instruments Incorporated 
Market Communications Manager, MS 736 


P.O. Box 1443 

Houston, Texas 77251-1443 
Ask questions about product Call the DSP hotline: 
operation or report suspected (713) 274-2320 


problems 


Report mistakes inthisdocument _ Fill out and return the reader response card at 


or any other TI documentation the end of this book, or send your comments to: 
Texas Instruments Incorporated 
Technical Publications Manager, MS 702 
P.O. Box 1443 
Houston, Texas 77251-1443 
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This user’s guide discusses the TMS320C5x digital signal processors (DSPs), 
the newest fixed-point generation in the TMS320 family. The ’C50, the ’C51, 
and the ’C53 are the first devices in this generation. Their central processing 
unit (CPU) core is based upon the ’C25’s CPU core with additional architectur- 
al enhancements to greatly improve overall performance. The 'C5x generation 
devices are capable of executing at twice the speed of the C2x and are 
source-code upward compatible with all ’C1x and ’C2x devices. Expansion of 
this fixed-point generation is expected in the near future to provide even higher 
levels of DSP performance. 


The ’C5x generation consists of the following devices: 


Jj ’C50 is a static CMOS digital signal processor with 10K words of on-chip 
RAM and 2K words of on-chip ROM. 

Jj ‘C51 is a static CMOS digital signal processor with 2K words of on-chip 
RAM and 8K words of on-chip ROM. 

[.} ’C53 is a static CMOS digital signal processor with 4K words of on-chip 
RAM and 16K words of on-chip ROM. 


This chapter discusses these topics: 


Topic Page 


1-1 


Introduction 


Figure 1-1. Evolution of the TMS320 Family 
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_TMS320 Family Overview 


1.1 TMS320 Family Overview 


The TMS320 family consists of 16-bit fixed-point. and 32-bit floating-point 
single-chip digital signal processing devices. These processors possess the 
operational flexibility of high-speed controllers and the numerical capability of 
array processors. Combining those two qualities, the TMS320 processors are 
inexpensive alternatives to custom-fabricated VLSI and multichip bit-slice pro- 
cessors. The following qualities make this family the ideal choice for a wide 
range of processing applications (refer to Table 1-1 for alis' applications): 


Lj Very flexible instruction set 

(_.] Inherent operational flexibility 

(_] High-speed performance 

(J Innovative, parallel architectural design 
[_] Cost effectiveness 


In 1982, Texas Instruments introduced the first fixed-point digital signal pro- 
cessor in the TMS320 family, the TMS32010. Before the year had ended, the 
Electronic Products magazine awarded the TMS32010 the title Product of 
the Year. The TMS32010 became the model for future TMS320 generations. 


Today, the TMS320 family consists of five generations: ’C1x, ’'C2x, ’C3x, ’C4x, 
and ’C5x. Figure 1—1 illustrates the performance gains that the TMS320 family 
has made over time with successive generations. Note that the 'C1x, ’C2x, and 
‘C5x generations are fixed-point, and the ’C3x and ’C4x generations are float- 
ing-point. Source code is upward compatible from one fixed-point generation 
to the next fixed-point generation and, likewise, from one floating-point gener- 
ation to the next floating-point generation. Compatibility preserves the soft- 
ware portion of your investment, thereby providing a convenient and cost-effi- 
cient roadmap to a higher performance, more versatile DSP system. 


Each generation of TMS320 devices has an internal core CPU and a variety 
of memory and peripheral configurations. New combinations of on-chip 
memory and peripheral options can create spin-off devices. These spin-offs 
can satisfy a wide range of needs in the worldwide electronics market. When 
memory and peripherals are integrated into one processor, overall system 
cost is greatly reduced and board space is saved. 


1.1.1. Typical Applications 


With its unique versatility and real-time performance, a 'C5x-generation pro- 
cessor offers better, more adaptable approaches to traditional signal-proces- 
sing problems such as vocoding and filtering. Furthermore, the ’C5x supports 
complex applications that often require several operations to be performed si- 
multaneously. Table 1—1 lists those applications for which a ’C5x device is well 
suited. 
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Table 1—1. Typical Applications for the TMS320 Family 


Adaptive Ride Control Digital Radio/TV Disk Drive Control 
Antiskid Brake Educational Toys Engine Control 
Cellular Telephone Music Synthesizer Laser Printer Control 
Digital Radio Power Tools Motor Control 
Engine Control Radar Detector Robotics Control 
Global Positioning Solid-State Answering Servo Control 


Navigation Machines 
Vibration Analysis 
Voice Commands 


General-Purpose Graphics/Imaging Industrial 


Adaptive Filtering 3-D Rotation Numeric Control 

Convolution Animation/Digital Map Power-Line Monitoring 

Correlation Homomorphic Processing Robotics 

Digital Filtering Pattern Recognition Security Access 

Fast Fourier Transforms Image Enhancement 

Hilbert Transforms Image Compression/ 

Waveform Generation Transmission 

Windowing Robot Vision 

Workstations 

Digital Filtering Diagnostic Equipment Image Processing 

Function Generation Fetal Monitoring Missile Guidance 

Pattern Matching Hearing Aids Navigation 

Phase-Locked Loops Patient Monitoring Radar Processing 

Seismic Processing Prosthetics Radio Frequency 

Spectrum Analysis Ultrasound Equipment Modems 

Transient Analysis Secure Communications 
; Sonar Processing 

1200- to 19200-bps DTMF Encoding/Decoding Speech Enhancement 

Modems Echo Cancellation Speech Recognition 


Adaptive Equalizer FAX Speech Synthesis 
ADPCM Transcoder Line Repeater Speaker Verification 
Cellular Telephone Speaker Phone Speech Vocoding 
Channel Multiplexing Spread Spectrum Voice Mail 

Data Encryption Communications Text-to-Speech 
Digital PBXs Video Conferencing 


Digital Speech Interpolation X.25 Packet Switching 
(DSI) 
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1.2 General Description 


The ’C5x generation consists of the ’'C50, the ’C51, and the ’C53 devices. 
These digital signal processors (DSPs) are fabricated in accordance with stat- 
ic CMOS integrated-circuit technology. Their architectural design is based 
upon that of the ‘C25. The combination of an advanced Harvard architecture 
(separate buses for program memory and data memory), additional on-chip 
peripherals, more on-chip memory, and a highly specialized instruction set is 
the basis of the operational flexibility and speed of these DSP devices. The 
’C5x devices are designed to execute more than 28 MIPS (million instructions 
per second). Future spin-off devices with the core CPU and customized 
on-chip memory and peripheral configurations may be developed for special- 
ized areas of the electronics market. 


The ’C5x generation offers these advantages: 


Enhanced TMS320 architectural design for increased performance and 
versatility 

Modular architectural design for fast development of spin-off devices 
Advanced IC processing technology for increased performance 
Downward source-code compatibility with °C 1x and 'C2x DSPs for fast 
and easy performance upgrades 

Enhanced TMS320 instruction set for faster algorithms and for optimized 
high-level language operation 

New static design techniques for minimizing power consumption and max- 
imizing radiation hardness 


ODO O OOO UO 


Table 1-2 provides an overview of the 'C5x generation of digital signal proces- 
sors. It shows the capacity of on-chip RAM and ROM memories, number of 
serial and parallel 1/O ports, execution time of one machine cycle, and type of 
package with total pin count. The chart should help you choose the best pro- 
cessor for an application. 


The following subsections summarize key features of the ’C5x processors. 
The CPU description applies to all ’C5x-generation members (current and fu- 
ture). Descriptions of the remaining features apply only to the ’C50, ’C51 and 
the ’C53. Detailed information on the CPU, on-chip peripherals, and memory, 
is given in Chapters 3, 5, and 6, respectively. 
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Table 1—2. Characteristics of the ’C5x DSP Processors 


Package 


TMS320 - en ee 1/0 Ports 
Device ee 
seat Mice GFP 
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t Note that 16 of the 64K parallel I/O ports are memory-mapped. 
§ QFP = Quad Flat Pack. 
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1.3 Key Features 


Key features of the ’C5x DSPs are listed below. Where a feature is exclusive 
to a particular device, the device’s name is enclosed within parentheses and 
noted after that feature. 


OO UO OUOO OO oO OOOO OC OOOOOOOOOOCL CO 


35-/50-ns_ single-cycle fixed-point instruction execution time (28.6/20 
MIPS) 

Upward source-code compatible with all ’C1x and ’'C2x devices 
RAM-based memory operation ('C50) 

ROM-based memory operation ('C51) 

9K x 16-bit single-cycle on-chip program/data RAM (’C50) 

1K x 16-bit single-cycle on-chip program/data RAM (’C51) 

3K x 16-bit single-cycle on-chip program/data RAM (’C53) 

2K x 16-bit single-cycle on-chip boot ROM (’C50) 

8K x 16-bit single-cycle on-chip program ROM (’C51) 

16K x 16-bit single-cycle on-chip program ROM (’C53) 

1056 x 16-bit dual-access on-chip data RAM 

224K x 16-bit maximum addressable external memory space (64K pro- 
gram, 64K data, 64K I/O, and 32K global) 

32-bit arithmetic logic unit (ALU), 32-bit accumulator (ACC), and 32-bit ac- 
cumulator buffer (ACCB) 

16-bit parallel logic unit (PLU) 

16 x 16-bit parallel multiplier with a 32-bit product capability 
Single-cycle multiply/accumulate instructions 

Eight auxiliary registers with a dedicated arithmetic unit for indirect ad- 
dressing 

Eleven context-switch registers (shadow registers) for storing strategic 
CPU-controlled registers during an interrupt service routine 

Eight-level hardware stack 

O- to 16-bit left and right data barrel-shifters and a 64-bit incremental data 
shifter 

Two indirectly addressed circular buffers for circular addressing 
Single-instruction repeat and block repeat operations for program code 
Block memory move instructions for better program/data management 
Full-duplex synchronous serial port for direct communication between the 
‘C5x and another serial device 

Time-division multiple-access (TDM) serial port 

Interval timer with period, control, and counter registers for software stop, 
start, and reset 

64K parallel I/O ports, 16 of which are memory mapped 

Sixteen software-programmable wait-state generators for program, data, 
and I/O memory spaces 
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Extended hold operation for concurrent external DMA 
Four-deep pipelined operation for delayed branch, call, and return instruc- 
tions 

Index-addressing mode 

Bit-reversed index-addressing mode for radix-2 FFTs 
Divide-by-one clock option 

On-chip clock generator 

JTAG boundary scan logic (IEEE standard, 1149.1) 
On-chip scan-based emulation logic 

5-V static CMOS technology with two power-down modes 
132-pin quad flat pack package 


DOUOUUOOUO OO 


Enhancements to the ’C5x CPU maintain source code compatibility with the 
‘C1x and ’C2x generations while improving performance and versatility. Im- 
provements include a 32-bit accumulator buffer, additional scaling capabili- 
ties, and a host of new instructions to exploit the additional hardware while sup- 
plying a more orthogonal instruction set to the user. The new control functions 
include an independent parallel logic unit (PLU) for performing Boolean opera- 
tions and a set of context-switch registers for providing zero-latency con- 
text-switching capabilities to interrupt service routines (ISRs). Data manage- 
ment has been improved through the use of new block move instructions and 
memory-mapped register instructions. The ‘C5x has 28 memory-mapped 
core-CPU registers and 16 memory-mapped I/O ports. See Chapter 3 for more 
details. 


1.3.2 On-Chip ROM 


The ’C50 features a2K x 16-bit on-chip, maskable, programmable ROM. This 
memory is used for booting from slower external ROM or EPROM of program 
to fast on-chip or external SRAM. ROM can be selected during reset by driving 
the MP/MT pin low. Once your program has been booted into the RAM, this 
boot ROM can be operationally removed from the program memory space via 
the MP/MC bit in the PMST status register. If the ROM is not selected, the 'C50 
starts its execution via an off-chip memory. 


The ’C51 features an 8K x 16-bit on-chip maskable ROM. The ’C53 features 
a 16K x 16-bit on-chip maskable ROM. You can use this memory for your spe- 
cified program. Once the development of the program has stabilized, submit 
a ROM code to Texas Instruments for implementation into your device. See 
Chapter 6 for more details. 
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1.3.3 On-Chip Data RAM 


1.3.4 


1.3.5 


1.3.6 


1.3.7 


On-Chip Prog 


On-Chip Mem 


All’C5x devices carry a 1056 x 16-bit on-chip data RAM. This RAM can be ac- 
cessed twice per machine cycle (dual-access RAM). This block of memory is 
primarily intended to store data values but, when needed, can be used to store 
programs as well as data. It can be configured in one of two ways: either all 
1056 x 16 bits as data memory or 544 x 16 bits as data memory with 512 x 16 
bits as program memory. You can select the configuration with the CNF bit in 
status register ST1. See Chapter 6 for more details. 


ram/Data RAM 


The ’C50 has a 9K x 16-bit on-chip RAM. The ’C51 has a 1K x 16-bit on-chip 
RAM. This memory is software configurable as program and/or data memory 
space. Code can be booted from an off-chip nonvolatile memory and then ex- 
ecuted at full speed, once it is loaded into this RAM. See Chapter 6 for more 
details. 


ory Security 


The ’C5x generation has a maskable option to protect the contents of on-chip 
memories. When the related bit is set, no externally originating instruction can 
access the on-chip memory spaces. See Chapter 6 for more details. 


Address-Mapped Software Wait-State Generators 


Software wait-state logic is incorporated without any external hardware into 
'C5x for interfacing with slower off-chip memory and I/O devices. This circuitry 
consists of 16 wait-state generating circuits and is user programmable to oper- 
ate O, 1, 2, 3, or 7 wait states. For off-chip memory accesses, these wait-state 
generators can be mapped on 16K-word boundaries in program memory, data 
memory, and to the I/O ports. See Chapter 5 for more details. 


Parallel 1/O Ports 


Each ’C5x device has a total of 64K I/O ports, sixteen of which are 
memory-mapped in data memory space. These ports can be addressed by the 
IN instruction or the OUT instruction. The memory-mapped I/O ports can be 
accessed with any instruction that reads or writes data memory. An active-low 
IS signal indicates a read/write operation via an I/O port. Requiring minimal 
off-chip address-decoding circuits, the ’C5x can easily interface with external. 
|/O devices via the I/O ports. See Chapter 5 for more details. 


1.3.8 Serial 1/O Ports 


The ’C5x devices carry two high-speed serial ports. These serial ports are ca- 
pable of operating at up to one-fourth the machine cycle rate (CLKOUT1). One 
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Key Features 


of the two circuits is a synchronous, full-duplex serial port. Its transmitter and 
receiver are double buffered and individually controlled by maskable external 
interrupt signals. Data is framed either as bytes or as words. The second circuit 
is a full-duplex serial port that can be configured either for synchronous or for 
time-division multiple-access (TDM) operations. The TDM serial port is com- 
monly used in multiprocessor applications. See Chapter 5 for more details. 


1.3.9 Hardware Timer 


The 'C5x features a 16-bit timing circuit with a 4-bit prescaler. This timer clocks 
between one-half and one-thirty-second the machine rate of the device itself, 
depending upon the programmable timer’s divide-down ratio. This timer can 
be stopped, restarted, reset, or disabled by specific status bits. See Chapter 
5 for more details. 


1.3.10 User-Maskable Interrupts 


The ’C5x devices have four external-interrupt lines. These lines are internally 
latched so that asynchronous interrupt operations can be performed by the 
TMS320 device. Also, each device possesses five internal interrupts: the timer 
interrupt and four serial port interrupts. See Chapter 5 for more details. 


1.3.11 JTAG Scanning Logic 


1.3.12 Packages 


The JTAG scanning logic circuitry is used for emulating and testing purposes 
only. The JTAG scan logic provides the boundary scan to and from the interfac- 
ing devices. Also, it can be used to test pin-to-pin continuity as well as to per- 
form operational tests on those peripheral devices that surround the 'CS5x. It 
is interfaced to another internal scanning logic circuitry, which has access to 
all of the on-chip resources. Thus, the 'C5x can perform on-board emulation 
by means of the JTAG serial scan pins and the emulation-dedicated pins. See 
IEEE Standard 1149.1 for more details. 


The ’C5x devices are packaged in a 132-pin quad flat pack package (QFP). 
With consideration for the pin layout of a ’C25 package, the 'C5x package is 
designed to minimize printed circuit board modifications when a ’'C2x-based 
system is upgraded to a 'C5x processing system. Signal callouts for the ’'C5x 
appear on the same side and in the same order as those for the 'C25. See 
Chapter 2 for details. 
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Pinouts a and Si oh LE nt os 


The 'C5x DSPs are available in a 132-pin quad flat pack (QFP) package and 
have identical pin-to-signal relationship. The QFP package conforms to 
JEDEC specifications for electrical/electronic components. Electrical specifi- 
cations and mechanical data for the ’C5x DSPs are in Appendix A. 


This chapter presents a simple layout of a 132-pin QFP package, with pin and 
signal callouts, and a table of signal definitions, in the following sections: 


Topic Page 


Pinout 


2.1 Pinout 


The ’C5x devices are packaged in a 132-pin quad flat pack package (QFP) and 
have the same pin-to-signal relationship. Figure 2-1 shows the pin/signal call- 
outs for this package. 


Figure 2—1. Signal Assignments for ’C5x 132-Pin QFP 
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Note: NC = Noconnect. (These pins are reserved.) 
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2.2 Signal Descriptions 


The signals for the ’C5x device are described in this section. Table 2—1 lists 
each signal, its pin location, function, and operating mode(s), i.e., input (I), out- 
put (O), high-impedance (Z) or supply (S) state. The signals are grouped ac- 
cording to their functional purpose. 


Table 2—1. TMS320C5x Signal Descriptions 


Signal | Pin | State | Description 


A15 
A14 
A13 
A12 
Alt 
A10 
AQ 
A8 
A7 
A6 
AS 
A4 
A3 
A2 
Al 
AO 


Address and Data Buses 


(MSB) Parallel address bus A15 (MSB) through AO (LSB). Multi- 
plexed to address external data/program memory or I/O. 
Placed in high-impedance state in hold mode. These signals 
also go into high impedance when OFF is active low. These 
signals are used as inputs for external DMA access of the 
on-chip single-access RAM. They become inputs while 
HOLDA is active low if the BR pin is externally driven low. 


Parallel data bus D15 (MSB) through DO (LSB). Multiplexed 
to transfer data between the core CPU and external data/ 
program memory or I/O devices. Placed in high-impedance 
state when not outputting or when RS or HOLD is asserted. 
They also go into high impedance when OFF is active low. 
These signals are also used in external DMA access of the 
on-chip single-access RAM. 


Allinput pins that are unused should be connected to Vpp or an external pull-up resistor. The BR pin has 
an internal pull-up for performing DMA to the on-chip RAM. For emulation, TRST has an internal pull- 
down, and TMS, TCK, and TDI have internal pull-ups. EMUO and EMU1 require external pull-ups to 
support emulation. 


Signal Descriptions 


Table 2-1. 


TMS320C5x Signal isi hae (Continued) 


_ Signal | Pin | State =| Description | 


Memory Control Signals 


Data, Program, and |/O space select signals. Always high 
unless low level asserted for communicating to a particular 
external space. Placed into a high-impedance state in hold 
mode. These signals also go into high-impedance when 
OFF is active low. 


Data ready input. Indicates that an external device is pre- 
pared for the bus transaction to be completed. If the device 
| is not ready (READY is low), the processor waits one cycle 
and checks READY again. READY also indicates a bus | 
grant to an external device after a BR (bus request) signal. | 


Read/Write signal. Indicates transfer direction during com- 
munication to an external device. Normally in read mode 
(high), unless low level asserted for performing a write oper- | 
ation. Placed in high-impedance state in hold mode. This sig- | 
| nal also goes into high impedance when OFF is active low, | 
and it is used in external DMA access of the 9K RAM cell. 
While HOLDA and TAQ are active low, this signal is used to | 
indicate the direction of the data bus for DMA reads (high) 
and writes (low). | 


Strobe signal. Always high unless asserted low to indicate | 
an external bus cycle. Placed in high-impedance state in the 
hold mode. This signal also goes into high impedance when 
OFF is active low, and it is used in external DMA access of | 
the on-chip single-access RAM. While HOLDA and IAQ are 
active low, this signal is used to select the memory access. 


Read select indicates an active, external read cycle and may | 
connect directly to the output enable (OE) of external de- | 
vices. This signal is active on all external program, data, and 
I/O reads. Placed into high-impedance state in hold mode. 
This signal also goes into high impedance when OFF is ac- 
tive low. 


| Write enable. The falling edge of this signal indicates thatthe | 
| device is driving the external data bus (D15—D0). Data may | 
be latched by an external device on the rising edge of W... | 
This signal is active on all external program, data, and I/O | 
writes. Placed into high-impedance state in hold mode. This 
signal also goes into high impedance when OFF is active 
low. 
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Table 2-1. TMS320C5x Signal Descriptions (Continued) 


Multiprocessing Signals 


Hold input. This signal is asserted to request control of the 
address, data, and control lines. When acknowledged by the 
*C5x, these lines go to the high-impedance state. 


Hold acknowledge signal. Indicates to the external circuitry 
that the processor is in a hold state and that the address, 
data, and memory control lines are in a high-impedance 
state so that they are available to the external circuitry for ac- 
cess of local memory. This signal also goes into high imped- 
ance when OFF is active low. 


Bus request signal. Asserted during access of external glob- 
al data memory space. READY is asserted to the device 
when the global data memory is available for the bus trans- 
action. BR can be used to extend the data memory address 
space by up to 32K words. It goes into high impedance when 
OFF is active low. BR is used in external DMA access of the 
on-chip single-access RAM. While HOLDA is active low, BR 
| is externally driven low to request access to the on-chip 
single-access RAM. 


Instruction acquisition signal. This signal is asserted (active 
low) when there is an instruction address on the address bus | 
and goes into high impedance when OFF is active low. [AQ 
is also used in external DMA access of the on-chip single-ac- 
cess RAM. While HOLDA is active low, [AQ acknowledges 
the BR request for access of the on-chip single-access RAM 
and stops indicating instruction acquisition. 


Branch control input. Samples as the BIO condition. If low, 
the device executes the conditional instruction. This signal 
| must be active during the fetch of the conditional instruction. 


| External flag output (latched software-programmable sig- 
nal). This signal is set high or low by specific instruction or 
by loading status register 1 (ST1). Used for signaling other 
processors in multiprocessor configurations or as a general- 
| purpose output pin. This signal also goes into high imped- 
ance when OFF is active low. This pin is set high at reset. 


Interrupt acknowledge signal. indicates receipt of an inter- 
rupt and that the program counter is fetching the interrupt 
vector location designated by A15—A0. This signal also goes 
into high impedance when OFF is active low. 
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Table 2-1. TMS320C5x Signal Descriptions (Continued) 


Signal | Pin | State = | Description a a | 


Initialization, Interrupt, and Reset Operations _ 


INT4 41 External user interrupt inputs. Prioritized and maskable by the interrupt mask 

INT3 a register and interrupt mode bit. Can be polled and reset via the interrupt flag 

INT2 register. 

INTT 38 : 
Nonmaskable interrupt. External interrupt that cannot be masked via the 
INTM orthe IMR. When NM1 is activated, the processor traps to the appropri- 
ate vector location. 

127 Reset input. Causes the device to terminate execution and forces the pro- | 


gram counter to zero. When R$ is brought to a high level, execution begins 
at location zero of program memory. RS affects various registers and status 
bits. 


Microprocessor/Microcomputer mode select pin. If active low at reset (micro- 
computer mode), the pin causes the internal program ROM to be mapped | 
into program memory space. In the microprocessor mode, all program 
memory is mapped externally. This pin is sampled only during reset, and the | 
mode that is set at reset can be overridden via the software control bit MP/ 


Oscillator/Timer Signals CLKIN1/2 _ 
Master clock output signal (or CLKIN2 frequency). This signal cycles at the 
machine-cycle rate of the CPU. The internal machine cycle is bounded by 
the rising edges of this signal. This signal also goes into high impedance 


CLKOUT1 110 | O/Z 
; when OFF is active low. 


CLKMD1 CLKMD2 Clock Mode | 
CLKMD1 71 External clock with divide-by-two option. Input 
CLKMD2 103 clock provided to X2/CLKIN1 pin. Internal oscilla- 
tor and PLL disabled. 
| 1 Reserved for test purposes. 
1 External divide-by-one option. Input clock pro- 
| vided to CLKIN2. Internal oscillator disabled. 
Internal PLL enabled. 
Internal or external divide-by-two option. Input 
clock provided to X2/CLKIN1 pin. Internal oscilla- 
| tor enabled. Internal PLL disabled. 


X2/CLKIN1 ? Input pin to internal oscillator from the crystal. If the internal oscillator is not 
being used, a clock may be input to the device on this pin. The internal ma- 
Le 


chine cycle is half this clock rate. 


Output pin from the internal oscillator for the crystal. If the internal oscillator 
is not used, this pin should be left unconnected. This signal does not go into 
high impedance when OFF is active low. 
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Table 2-1. TMS320C5x Signal Descriptions (Continued) 


Signal | Pin | State =| Description _ | 


Oscillator/Timer Signals (Concluded) 


CLKIN2 Divide-by-1 input clock for driving the internal machine 
rate. 
TOUT ie Timer output. This pin signals a pulse when the on-chip tim- 


er counts down past zero. The pulse is a CLKOUT1 cycle 
Supply Pins 


wide. 


Vopr Power supply for data bus. 
Vope2 Power supply for data bus. 


Vpop3 Power supply for data bus. 


Vpp4 Power supply for data bus. 


Vpps Power supply for address bus. 


Vppe Power supply for address bus. 


Vpp7 Power supply for inputs and internal logic. 


Power supply for inputs and internal logic. 


ak. 5 ~] 


Vpps 


| Vpp9 Power supply for address bus. 


Power supply for address bus. 


Vpp10 


| Vopi14 Power supply for memory control signals. 


Vpp12 Power supply for memory control signals. 


_ 
—_ 
eR) 


Power supply for inputs and internal logic. | 


Vop13 


Power supply for inputs and internal logic. 


ok 
_ 
> 


Vpp14 


Vpp15 Power supply for memory control signals. | 


Vopiée Power supply for memory control signals. 


Vssi1 Ground for memory control signals. 


Ground for memory control signals. 


Vsse2 
Vss3 Ground for data bus. 
Vssq4 Ground for data bus. 
Vss5 Ground for data bus. 
Vsse 


Vss7 


Ground for data bus. 


Ground for address bus. 


Vsse Ground for address bus. 


Vsso 
| Vgsi0 
Vssi1 


Ground for address bus. 


Ground for address bus. 


Ground for memory control signals. | 


Vssi2 Ground for memory control signals. 


awk, 


ie) 
N 


| Vssi3_ Ground for inputs and internal logic. 


oF 
N 
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Table 2-1. TMS320C5x Signal Descriptions (Continued) 


Supply Pins escheat 
Sa A 
[Vess_ [120 [S| Groundtorinputsandiniemaltogic 
[Yess | 1 [S| Groundiorinpsts andinimatiog. | 


Serial Port Signals 


Receive clock inputs. External clock signal for clocking 
data from the DR/TDR (data receive) pins into the RSR 
(serial port receive shift register). Must be present during | 
serial port transfers. If the serial port is not being used, 
these pins can be sampled as an input via the INO bit of the | 
SPC/TSPC registers. 3 


Transmit clock. Clock signal for clocking data from the DR/ 

TDR (data receive register) to the DX/TDX (data transmit | 
pin). The CLKX can be an input if the MCM bit in the serial | 
port control register is set to 0. It may also be driven by the | 
device at 1/4 the CLKOUT1 frequency when the MCM bit 
is set to 1. If the serial port is not being used, this pin can 
be sampled as an input via the IN1 bit of the SPC/TSPC | 
register. This signal goes into high impedance when OFF 
is active low. 


Serial data receive inputs. Serial data is received in the 
RSR (serial port receive shift register) via the DR/TDR pin. 


Serial port transmit outputs. Serial data transmitted from | 
the XSR (serial port transmit shift register) via the DX/TDX | 
pin. Placed in high-impedance state when not transmitting 
and also when OFF is active low. 


Frame synchronization pulse for receive input. The falling 

edge of the FSR/TFSR pulse initiates the data receive pro- 
cess, beginning the clocking of the RSR. TFSR becomes 
an input/output (TADD) pin when the serial port is operat- | 
ing in TDM mode (TDM bit = 1). In TDM mode, this pin is 
used to output/input the address of the port. This signal | 
goes into high impedance when OFF is active low. . 


Frame synchronization pulse for transmit input/output. The 

falling edge of the FSX/TFSX pulse initiates the data trans- | 
mit process, beginning the clocking of the XSR. Following | 
reset, the default operating condition of FSX/TFSxX is an in- 
put. This pin may be selected by software to be an output | 
when the TXM bit in the serial control register is set to 1. 
This signal goes into high impedance when OFF is active | 
low. When operating in TDM mode (TDM bit = 1), the TFSX 

pin becomes TFRM, the TDM frame synch. | 


FSX 104 (/O/Z 
TFSX/TFRM 105 /O/Z 


9-8 Pinout and Signal Descriptions 


Signal Descriptions 


Table 2-1. TMS320C5x Signal Descriptions (Continued) 


Test Signais 


TCK JTAG test clock. This is normally a free-running clock sig- 
nal with a 50% duty cycle. The changes on TAP (test ac- 
cess port) input signals (TMS and TDI) are clocked into the 
TAP controller, instruction register, or selected test data 
register on the rising edge of TCK. Changes at the TAP out- 

| put signal (TDO) occur on the falling edge of TCK. 


JTAG test data input. TDI is clocked into the selected regis- 
ter (instruction or data) on a rising edge of TCK. 


JTAG test data output. The contents of the selected regis- 
ter (instruction or data) is shifted out of TDO on the falling 
edge of TCK. TDO is in high-impedance state except when 
scanning of data is in progress. This signal also goes into 
high impedance when OFF is active low. 


JTAG test mode select. This serial control input is clocked 
into the test access port (TAP) controller on the rising edge 


JTAG test reset. This signal, when active high, gives the 
JTAG scan system control of the operations of the device. 
If this signal is not connected or driven low, the device will 
operate in its functional mode, and the JTAG signals are ig- 
nored. 


Emulator pin 0. When TRST is driven low , this pin must be 
high for activation of the OFF condition (see pin 119). 
When TRST is driven high, this pin is used as an interrupt | 
to or from the emulator system and is defined as input/out- 
put via JTAG scan. 


Signal Descriptions 


Table 2-1. TMS320C5x Signal Descriptions (Concluded) 
| Signal =| PI in| | State = | Description | — 


Test pF TestSignais (Concluded) (Concluded) 


~ EMU1/OFF nie /O/Z Emulator pin 1/disable all outputs. a 1, When TRST is driven 


high, this pin is used as an interrupt to or from the emulator 
RESERVED 


system and is defined as input/output via JTAG scan. 
When TRST is driven low, this pin is configured as OFF. 
The EMU1/ OFF signal, when active low, puts all output 
drivers into the high-impedance state. Note that OFF is 
used exclusively for testing and emulation purposes (not 
for multiprocessing applications). Thus, for OFF condition, 
the following conditions apply: 

TRST=low, 
EMUO=high 
EMU1/OFF=low 


Reserved pin. These pins are reserved for future 'C5x de- 
vices. These pins should be left unconnected. 
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The architectural structure of a TMS320 DSP consists of three basic seg- 
ments: 


[ Central processing unit (CPU) 
[_} Memory 
(_} Peripheral-interfacing circuits 


This chapter describes the architecture and operation of the ’C5x core CPU; 
the memory and peripheral segments are not discussed except in relation to 
the core CPU of the ’C5x generation. This CPU is capable of performing 
high-speed arithmetic executions within a short instruction cycle by means of 
its highly parallel architectural design. 


For information on the memory organization of the ’C5x, refer to Chapter 6, 
Memory. For further details about on-chip peripheral organization, refer to 
Chapter 5, Peripherals. The major topics in this chapter are: 
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3.1 Architectural Overview 


3-2 


The ’C5x high-performance digital signal processors are designed, like the 
‘C25, with an advanced Harvard-type architecture that maximizes the pro- 
cessing power by maintaining two separate memory bus structures, program 
and data, for full-speed execution. Instructions support data transfers between 
the two spaces. 


The ’C5x performs 2s-complement arithmetic, using the 32-bit arithmetic log- 
ic unit (ALU) and accumulator. The ALU is a general-purpose arithmetic unit 
that uses 16-bit words taken from data memory or derived from immediate in- 
structions, or the 32-bit result from the multiplier. In addition to arithmetic oper- 
ations, the ALU can perform Boolean operations. The accumulator stores the 
output from the ALU and is also the second input to the ALU. The accumulator 
is 32 bits long and is divided into a high-order word (bits 31 through 16) and 
a low-order word (bits 15 through 0). Instructions are provided for storing those 
high- and low-order accumulator words in memory. For fast, temporary stor- 
age of the accumulator, there is a 32-bit accumulator buffer. 


In addition to the main ALU, there is a parallel logic unit (PLU) that executes 
logic operations on data without affecting the contents of the accumulator. The 
PLU provides the bit-manipulation ability required of a high-speed controller 
and simplifies the bit setting, clearing, and testing required with control and sta- 
tus register operations. 


The multiplier performs 16 x 16-bit 2s-complement multiplication with a 
32-bit result in a single-instruction cycle. The multiplier consists of three ele- 
ments: multiplier array, PREG (product register), and TREGO (temporary reg- 
ister). The 16-bit TREGO temporarily stores the multiplicand; the PREG stores 
the 32-bit product. The multiplier’s values come from data memory, come from 
program memory when the MAC/MACD/MADS/MADD instructions are used, 
or are derived immediately from the multiply immediate instructions (MPY #). 
The fast on-chip multiplier allows the device to efficiently perform fundamental 
DSP operations such as convolution, correlation, and filtering. 


The ’C5x scaling shifter has a 16-bit input connected to the data bus anda 
32-bit output connected to the ALU. The scaling shifter produces a left shift 
of 0 to 16 bits on the input data, as programmed in the instruction or defined 
in the shift count register (TREG1). The LSBs of the output are filled with zeros, 
while the MSBs may be either zero-filled or sign-extended, depending upon 
the state of the sign-extension mode bit (SXM) of status register ST1. Addition- 
al shift capabilities enable the processor to perform numerical-scaling, bit-ex- 
traction, extended-arithmetic, and overflow-prevention operations. 


Eight levels of hardware stack save the contents of the program counter dur- 
ing interrupts and subroutine calls. On interrupts, the strategic registers (ACC, 
ACCB, ARCR, INDX, PMST, PREG, STO, ST1, TREGs) are pushed onto a 
one-deep stack and popped upon interrupt return, thus providing a zero-over- 
head interrupt context switch. 
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3.2 Functional Block Diagram 


The functional block diagram, shown in Figure 3—1, outlines the principal 
blocks and data paths within the ‘C5x processors. Further details of the func- 
tional blocks are provided in the succeeding sections. Refer to Section 3.3, /n- 
ternal Hardware Summary, for definitions of the symbols used in Figure 3—1. 


The ’C5x architecture is built around two major buses: the program bus and 
the data bus. The program bus carries the instruction code and immediate op- 
erands from program memory. The data bus interconnects various elements, 
such as the central arithmetic logic unit (CALU) and the auxiliary register file, 
to the data memory. Together, the program and data buses can carry data from 
on-chip data memory and internal or external program memory to the multipli- 
er in a single cycle for multiply/accumulate operations. 


The ’C5x possesses a high degree of parallelism; that is, while the data is be- 
ing operated upon by the CALU, arithmetic operations may also be executed 
in the auxiliary register arithmetic unit (ARAU). Such parallelism results in a 
powerful set of arithmetic, logic, and bit-manipulation operations that may all 
be performed in a single machine cycle. 


Functional Block Diagram 


RIgUie 3—1. Block Diagram of ’C5x Internal Hardware 
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3.3 Internal Hardware Summary 


The internal hardware of the 'C5x executes functions that other processors 
typically implement in software or microcode. For example, the device con- 
tains hardware for single-cycle 16 x 16-bit multiplication, data shifting, and 
address manipulation. This hardware-intensive approach provides computing 
power previously unavailable on a single chip. 


Table 3—1 presents a summary of the ’C5x’s internal hardware. This summary 
table, which includes the internal processing elements, registers, and buses, 
is alphabetized. All of the symbols used in the table correspond to the symbols 
used in Figure 3—1, the succeeding block diagrams in this chapter, and the text 
throughout this document. 


Table 3-1. ’C5x Internal Hardware Summary 


Accumulator ACC(32) A 32-bit accumulator accessible in two halves: ACCH (accumulator high) 
ACCH(16) and ACCL (accumulator low). Used to store the output of the ALU. See sub- 
section 3.5.2 for more information. 


ACCL(16) 


Accumulator Buffer A register used to temporarily store the 32-bit contents of the accumulator. 
This register has a direct path back to the ALU and therefore can be arithmet- 
ically or logically acted upon with the ACC. See subsection 3.5.2 for more 


information. 


ACCB(32) 


Arithmetic Logic Unit A 32-bit 2s-complement arithmetic logic unit having two 32-bit input ports 
and one 32-bit output port feeding the accumulator. See subsection 3.5.2 for 


more information. 


Auxiliary Register 
Arithmetic Unit 


Auxiliary Register ARCR(16) 
Compare 


Auxiliary Register File AUXREGS 
Auxiliary Register Buffer ARB(3) 
Auxiliary Register Pointer ARP(3) 
Block Move Address BMAR(16) 
Register 


Block Repeat BRAF(1) 
Active Flag 


Block Repeat Address PAER(16) 
End Register 


An unsigned 16-bit arithmetic unit used to calculate indirect addresses using 
the auxiliary, index, and compare registers as inputs. See subsection 3.4.3 
for more information. 


A 16-bit register used as a limit to compare indirect address against. See 
subsection 3.4.3 for more information. 


A register file containing eight 16-bit auxiliary registers (ARO—AR7) used for 
indirect data address pointers, temporary storage, or integer arithmetic pro- 
cessing through the ARAU. See subsection 3.4.3 for more information. 


A3-bit register that holds the previous value contained in the ARP. These bits 
are stored in ST1. See subsection 3.4.3 for more information. 


A3-bit register used as a pointer to the currently selected auxiliary register. 
These bits are stored in STO. See subsection 3.4.3 for more information. 


A 16-bit register that holds an address value for use with block moves or mul- 
tiply/accumulates. See subsection 3.4.2 for more information. 


A 1-bit flag indicating that a block repeat is currently active. This bitis normal- | 
ly set when the RPTB instruction is executed and cleared when the BRCR 
register decrements below zero. This bit resides in the PMST register. See 
subsection 3.6.5 for more details. 


A 16-bit memory-mapped register containing the end address of the seg- 
ment of code being repeated. See subsection 3.6.5 for more details. 
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Table 3—1.’C5x Internal Hardware (Continued) 


Unit Symbol Function 

Block Repeat Address PASR(16) | 
| Start Register 
| Block Repeat | BRCR(16) 

Counter Register 

Bus Interface Module | BIM | A buffered interface used to pass data between the internal data and pro- | 

gram buses. 

Bus Request | This signal indicates that a data access is mapped to global memory space 
| as defined by the GREG register. See Section 6.4 for more details. | 
This bit stores the carry output of the ALU. This bit resides in ST1. See sub- 
| Central Arithmetic Logic CALU 

Unit 


Circular Buffer CBCR(8) 
Control Register | 


A 16-bit memory-mapped register containing the start address of the seg- 
ment of code being repeated. See subsection 3.6.5 for more details. | 


A 16-bit memory-mapped counter register used to limit the number of times 
the block is to be repeated. See subsection 3.6.5 for more details. 


section 3.5.2 for more information. 


The grouping of the ALU, multiplier, accumulator, and scaling shifters. See | 
Section 3.5 for more information. 


An 8-bit register used to enable/disable the circular buffers and define which 
auxiliary registers are mapped to the circular buffers. See subsection 3.4.3 | 
for more information. 


Circular Buffer CBER(16) Two 16-bit registers indicating circular buffer end addresses. CBER1 and | 
End Address CBER1(16) | CBER2 are associated with circular buffers one and two, respectively. See | 
7 CBER2(16) | subsection 3.4.3 for more information. 


Two 16-bit registers indicating circular buffer start addresses. CBSR1 and 
CBSR2 are associated with circular buffers one and two, respectively. See 
subsection 3.4.3 for more information. 


CBSR(16) 
CBSRi1(16) 


CBSR2(16) 
This bit indicates whether on-chip dual-access RAM blocks are mapped to | 


‘| COMPARE 

ConfigureRAM | CNF 
program or data space. The CNF bit resides in ST1. See subsection 3.6.3 | 
for more information. : 


Circular Buffer 
Start Address 


This circuit compares the current value in the PC to the value in PAER if 
BRAF is active. If the compare shows equal, then the PASR is loaded into 
the PC. See subsection 3.4.3 for more information. | 


Compare of Program 
Address 


DATA _|_ A 16-bit bus used to route data. ian) 
Data Memory DATA This block refers to data memory used with the core and defined in specific 
MEMORY =| device descriptions. It refers to both on- and off-chip memory blocks in data | 
memory space. 
Data Memory | DATA A 16-bit bus that carries the address for data memory accesses. __ 
Address Bus ADDRESS 


Data Memory Address dma(7) A7-bit register containing the immediate relative address within a 128-word 
Immediate Register 


data page. See subsection 3.4.2 for more information. 
Data Memory DP(9) 
Page Pointer 
Data RAM Map Bit ~RAM(1) | 


Direct Data Memory | DRB(16) 
Address Bus 
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A 9-bit register containing the address of the current page. Data pages are 
128 words each, resulting in 512 pages of addressable data memory space 
(some locations are reserved). See subsection 3.4.2 for more information. 


This bit indicates if the single-access RAM is mapped into data space. See 
subsection 3.6.3 for more information. 


A 16-bit bus that carries the direct address for the data memory, which is the | 
concatenation of the DP register and the seven LSBs of the instruction 
(DMA). See subsection 3.4.2 for more information. 
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Table 3—1.’C5x Internal Hardware (Continued) 


A 16-bit memory-mapped register used as a mask input to the PLU in the ab- 
sence of a long immediate value. See Section 3.7 for more information. 


Dynamic Bit DBMR(16) 


Manipulation Register 


Dynamic Bit Pointer TREG2(4) A 4-bit register that holds a dynamic bit pointer for the BITT instruction. See 


Section 4.3 for more information. 


Dynamic Shift Count TREG1 (5) 


External Flag XF(1) 


A 5-bit register that holds a dynamic prescaling shift count for data inputs to 
the ALU. See Section 4.3 for more information. 


This bit drives the level of the external flag pin and resides in ST1. See sub- 
section 3.6.3 for more information. 


Global Memory GREG(8) An 8-bit memory-mapped register for specifying the size of the global 
Allocation Register memory space. See Section 6.4 for more details. 


| Hold Mode HM(1) This bit resides in ST1 and determines whether the CALU will stop or contin- 
ue when the HOLD signal initiates a power-down mode. See Section 6.6 for 
more information. 


Index Register INDX(16) This 16-bit memory-mapped register specifies increment sizes greater than 
1 for indirect addressing updates. In bit-reversed addressing, the index reg- 


ister defines the array size. See subsection 3.4.3 for more information. 


This bit determines whether a modification or write to ARO writes also to 
INDX and ARCR to maintain compatibility with the C25. This bit resides in 
the PMST register. See subsection 3.4.3 for more information. 


Index Register Enable 


Interrupt Flag Register IFR(16) 


Interrupt Mask Bit INTM(1) 


Interrupt Number INT#(4) 


NDX(1) 


A 16-bit flag register used to latch the active-low interrupts. The IFR is a me- 
mory-mapped register. See Section 3.8 for more information. 


The interrupt mask bit globally masks or enables all interrupts. This bit re- 
sides in STO. See Section 3.8 for more information. 


The number of the specific interrupt being sent to the CPU to be activated. 
This value comes from either the interrupt-processing Circuitry or, in the case 
of the INTR instruction, the program bus. See Section 3.8 for more informa- 
tion. 


Interrupt Pointer IPTR(5) Five bits pointing to the 2K page where the interrupt vectors currently reside 
in the system. These bits reside in the PMST register. See Section 3.8 for 


more information. 


Interrupt Mask Register IMR(16) A 16-bit memory-mapped register used to mask interrupts. See Section 3.8 
for more information. 


Microcail Stack MCS (15-0) Asingle-word stack that temporarily stores the contents of the PFC while the 

PFC is being used to address data memory with the block move (BLDD/ 
BLPD), multiply-accumulate (MAC/MACD), and table read/write (TBLR/ | 
TBLW) instructions. 


This bit resides in the PMST register and indicates whether the on-chip ROM 
is mapped into program address space. See subsection 3.6.3 for more infor- 
mation. 


| Microprocessor/ 
Microcomputer Mode 


Multiplexer | MUX A bus multiplexer used to select the source of operands for a bus or execu- 
tion unit, depending on the nature of the current instruction. 


Multiplier MULTIPLIER A 16 x 16-bit parallel multiplier. See subsection 3.6.3 for more information. 


wine | Syimpoi Function 


Overflow Flag | OV(1) 
Overflow Mode OVM(1) 


Overlay to Data Space OVLY (1) 


This bit resides in STO and indicates an overflow in an arithmetic operation | 
in the ALU. See subsection 3.6.3 for more information. 


This bit resides in STO and determines whether an overflow in the ALU will | 
wrap around or saturate. See subsection 3.6.3 for more information. 


This bit resides in the PMST register and determines whether the on-chip 
single-access memory will be addressable in data address space. See sub- 
section 3.6.3 for more information. 


Parallel Logic Unit PLU A 16-bit logic unit that executes logic operations from either long immediate 
operands or the contents of the DBMR directly upon data locations without 
interfering with the contents of the CALU registers. See Section 3.7 for more 
information. 

Prefetch Counter PFC (15-0) | A 16-bit counter used to prefetch program instructions. The PFC contains 


the address of the instruction currently being prefetched. It is updated when 
a new prefetch is initiated. The PFC can also address program memory 
when the block move (BLPD), multiply-accumulate (MAC/MACD), and table 
read/write (TBLR/TBLW) instructions are used and can address data 
memory when the block move (BLDD) instruction is used. 


A four-bit register that contains the value for the prescaling operation. When | 
the register contents are used as prescaling data, this register is loaded from 
the dynamic shift count or from the instruction. In conjunction with the BIT 
and BITT instructions, this register is loaded from the dynamic bit pointer or 
the instruction word. 


Prescaler Count Register | COUNT(4) 


A 32-bit product register used to hold the multiplier’s product. The high and | 
low words of the PREG can be accessed individually. See subsection 3.5.3 
: for more information. 


Program Bus | PROG | A 16-bit bus used to route instructions (and data for the MAC and MACD in- 
| DATA structions). 


Program Counter PC(16) A 16-bit program counter used to address program memory sequentially. 
The PC always contains the address of the next instruction to be fetched. 
The PC contents are updated following each instruction decode operation. | 


Product Register PREG(32) 


Program Memory PROGRAM 
MEMORY 


This block refers to program memory used with the core and defined in spe- 
Cific device descriptions. It refers to both on- and off-chip memory blocks ac- 
cessed in program memory space. 


Program Memory PROG | | A 16-bit bus that carries the program memory address. | 
| Address Bus ADDRESS 

Prescaling Shifter | PRESCALER | AO- to 16-bit left barrel shifter used to prescale data coming into the ALU. 

, | Also used to align data for multiprecision operations. This shifter is also used 
| | asa0-to 16-bit right barrel shifter of the ACC. See subsection 3.5.2 for more 
information. 
Postscaling Shifter POST- A0- to 7-bit left barrel shifter used to postscale data coming out of the CALU. 
SCALER See subsection 3.5.2 for more information. 


Product Shifter P-SCALER AO-, 1-, or 4-bit left shifter that can remove extra sign bits (gained in the multi- | 


ply operation) when fixed-point arithmetic is used; or a 6-bit right shifter that 


can scale the products down to avoid overflow in the accumulation process. 
See subsection 3.5.3 for more information. 
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Table 3-1. ’C5x Internal Hardware (Continued) 


Product Shifter Mode PM(2) These two bits define the product shifter mode; They reside in ST1. See sub- 


section 3.6.3 for more information. 
Sign Extension Mode | SXM(1) 
a 
Status Registers STO, ST1, 
PMST 
Temporary Multiplicand TREGO(16) 


Temporary Registers En- TRM(1) 
able 


Test/Control Flag TC(1) | 


A 16-bit counter used to control the repeated execution of a single instruc- 
tion. See subsection 3.6.4 for more information. | 


This bit resides in ST1 and controls whether the arithmetic operation will be 
sign-extended or not. See subsection 3.6.3 for more information. | 


An 8 x 16-bit hardware stack used to store the PC during interrupts and calls. 
The ACCL and data memory values may also be pushed onto and popped 
from the stack. See Section 3.8 for more information. 


Three 16-bit status registers that contain status and control bits. See subsec- 
tion 3.6.3 for more information. 


A 16-bit register that temporarily holds an operand forthe multiplier. See sub- 
section 3.5.3 for more information. 


This bit defines whether an LT(A,D,PS) instruction loads ail three of the 
TREGs(0,1,2) to maintain compatibility with the "C25 or loads just TREGO. 
This bit resides in the PMST register. See subsection 3.6.3 for more informa- 
tion. 


This bit resides in ST1 and stores the results of ALU or PLU test bit opera- | 
tions. See subsection 3.6.3 for more information. | 


3.4 


3.4.1 


Internal Memory Organization 


This section describes the memory use of the ’C5x core and the addressing 
modes supported by the core. 


Core Processor Memory-Mapped Registers 


Twenty-eight core processor registers are mapped into the data memory 
space. These are listed in Table 3—2. An additional 64 memory-mapped regis- 
ters are reserved in page 0 of data space. These data memory locations are 
reserved for memory-mapped peripheral control and I/O port registers. 


Table 3-2. Core Processor Memory-Mapped Registers 
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Description 


"C5x ’C5x 
Dec Hex 
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Name 


Reserved 

Interrupt mask register 
Global memory allocation register 

Interrupt flag register 

Processor mode status register 

Repeat counter register 

Block repeat counter register 

Block repeat program address start register 
Block repeat program address end register 
Temporary register for multiplicand 
Temporary register for dynamic shift count 
Temporary register used as bit pointer 

in dynamic bit test 

Dynamic bit manipulation register 
Auxiliary register zero 

Auxiliary register one 
Auxiliary register two 
Auxiliary register three 
Auxiliary register four 
Auxiliary register five 
Auxiliary register six 
Auxiliary register seven 

Index register 
Auxiliary register compare register 

Circular buffer 1 start address register 
Circular buffer 1 end address register 
Circular buffer 2 start address register 
Circular buffer 2 end address register 
Circular buffer control register 
Block move address register 


MUOWPOONOoA 


Memory-mapped peripheral registers. See | 
Table 5-1. : 


80-95 | 50-5F | Memory-mapped I/O port. See Table 5-1. 
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3.4.2 Memory Addressing Modes 


The ’C5x can address a total of 64K words of program memory and 96K words 
of data memory. Chapter 6 shows how the on-chip program and data memo- 
ries are mapped. 


The data used as instruction operands is obtained in one of the following eight 
ways: | 


By the direct address bus (DRB) using the direct addressing mode (e.g., 
ADD 010h) relative to the data memory page pointer (DP) 

By the DRB using the memory-mapped addressing mode (that is, LAMM 
PMST) within data page zero 

By the auxiliary register file bus (AFB) using the indirect addressing mode 
(that is, ADD *) 

By the instruction register (IREG) in short immediate operand mode (that 
is, ADD #0FFh) 

By the program counter (PC) in long immediate operand mode (that is, 
ADD #0FFFFh) 

By the core CPU access of a register in register access mode (that is, APL 
*+ or MPY *+) 

By the second instruction word in long immediate address mode (that is, 
BLDD #4TBL1,*+) 

By the block memory address register (BMAR) in registered block memory 
addressing mode (that is, BLDD *+) 


ou ooo oO Oo 


In the direct addressing mode, the 9-bit DP points to one of 512 pages (1 page 
= 128 words). The data memory address (dma), specified by the seven LSBs 
of the instruction, points to the desired word within the page. The address on 
the DRB is formed by concatenating the 9-bit DP with the 7-bit dma. Figure 3-2 
illustrates direct addressing mode. In the illustration, the operand is fetched 
from data memory space via the data bus, and the address is the concatenated 
value of the DP and the seven LSBs of the instruction. Note that bit 7=0 defines 
the addressing mode as direct. 
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Figure 3-2. Direct Addressing Mode 
ADD 010h 


Machine Code = 


0010 SHFTt 101001 oOo000 


/ / 


DP = 1100 1110 1 / / 
/ / 
\ \ 7 / 
DRB = 1100 1110 11/0 0 1 0000 
Operand = Data(DRB) 


t SHFT represents a 4-bit shift value. 


Memory-mapped addressing mode operates much like direct addressing 
mode except that the most significant 9 bits of the address are forced to zero 
instead of being loaded with the contents of the DP. This allows the user to di- 
rectly address the memory-mapped registers of data page zero without the 
overhead of changing the DP or auxiliary register. Figure 3-3 illustrates 
memory-mapped addressing mode. 


Figure 3—3. Memory-Mapped Addressing Mode 


LAMM PMST 
Machine Code = 0000 1000 Toro 0 0 O11 7 
/ / 
pp = [0000 0000 0 y H 
BS Net / 
DRB = 0000 0000 010 0 0 O11 1] 
Operand = Data(DRB) 
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Internal Memory Organization 


In the indirect addressing mode, the currently selected 16-bit auxiliary register 
AR(ARP) addresses the data memory through the auxiliary register file bus 
(AFB). While the selected auxiliary register provides the data memory address 
and the data is being manipulated by the CALU, the contents of the auxiliary 
register may be manipulated through the ARAU. See Figure 3—4 for an exam- 
ple of indirect auxiliary register addressing. Also, bit 7=1 defines this address- 
ing mode as indirect. 


Figure 3—4. Indirect Addressing Mode 


ADD - 


Machine Code = 0010 SHFT {11/000 0000 


ARP =|0 1 1/-®AR3 =|1 100 1110 1001 #0000 


Operand =  Data(AR(ARP)) 


The operand may reside as part of the instruction machine code. In the case 
of the short immediate operand, the operand is contained in the single-word 
instruction. These short immediate operands vary in length from 1 bit on the 
SETC instruction to 13 bits on the MPY instruction. Figure 3—5 shows an ex- 
ample of short immediate mode. Note that, in this example, the lower eight bits 
are the operand and will be added to the ACC by the CALU. 


Figure 3—5. Short Immediate Mode 


ADD ~~ #0FFh 
MachineCode = [1011 1000 {1111 #1411411 
Operand = |1 11 1 11141 
Operand = Data(ADD(7 — 0)) 
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In the case of the long immediate operand, the operand immediately follows 
the opcode in the program sequence. The long immediate operand is 16 bits 
long. Figure 3—6 shows an example of long immediate mode. In this example, 
the second word of the two-word instruction is added to the ACC by the CALU. 


Figure 3—6. Long Immediate Mode 


ADD #01234h 


Machine Code 


Operand 


Operand 


O00T 00710 OOTT O17 0 Ob 


= Data(second word(15 — 0)) 


The operand may come from a CPU register. This type of operand is used in 
special cases. The CALU uses this in multiplying with TREGO, in shifting with 
TREG1 and PM, and in bit manipulation with TREG2. The ARAU uses this with 
INDX and ARCR. The PLU uses this with DBMR. Figure 3-7 illustrates the use 
of the DBMR register as an AND mask in the APL instruction. 


Figure 3—7. Register Access Mode 


APL 010h 
Machine Code = 001 0000 
| ye ca 
OP f / 
\ ‘ ir ye 


DRB = [1100 1110 1/001 o000 


Operand1 
Operand2 


=  Data(DRB) 
= f{DBMR 


In the long immediate addressing mode, an operand is addressed by the sec- 
ond word of a two-word instruction. In this case, the program address/data bus 
(PAB) is used for the operand fetch. The PC is stored in a temporary register, 
and the long immediate value is loaded into the PC. Then, the PAB is used for 
the operand fetch or write. At the completion of the instruction, the PC is re- 
stored from the temporary register, and execution continues. This technique 
is used when two memory addresses are required for the execution of the in- 
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struction. The PC is used so that, when an instruction is repeated, the address 
generated can be autoincremented. Figure 3-8 illustrates this mode. In this 
illustration, the source address (OPERAND‘1) is fetched via PAB, and the des- 
tination address (OPERAND2) uses the direct addressing mode. 


Figure 3—8. Long Immediate Addressing Mode 


BLDD #02345h, 012h 


Machine Code 1 = _ 001 0 
7 
/ 
/ Je 
pP= {1100 1110 1 J y 
\ s. / / 
\ XQ 7 / 
DRB = 11100 1110 001 0010 
Operand2 = Data (DRB) 
MachineCode2 = 10010 0011 0100 0101 
une a 
PC = 0010 0011 0100 0101 
Operand1 = Data (PC) 


Registered block memory addressing mode operates like the long immediate 
addressing mode with the exception that the address comes from BMAR. The 
advantage of this technique is that the address of the block of memory to be 
acted upon can be changed during execution of the program. The address in 
long immediate addressing mode resides in the program flow and cannot be 
easily changed. Figure 3—9 shows an example of registered block memory ad- 
dressing mode. 
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Figure 3—9. Registered Block Memory Addressing Mode 
BLDD BMAR, 012h 


Machine Code 1 = 1010 1100 0/001 0010 
/ 
a / ts 
pp = [41100 i110 i 1/ / 
\ / / 
DRB = 1100 1110 #11001 0010 
Operand2 = Data (DRB) 
EMAAR)» [FC] 
Operand1 = Data (PC) 


3.4.3 Auxiliary Registers 


The ’C5x provides a register file containing eight auxiliary registers 
(ARO—-AR7). The auxiliary registers may be used for indirect addressing of the 
data memory or for temporary data storage. Indirect auxiliary register address- 
ing (see Figure 3—10) allows placement of the data memory address of an in- 
struction operand into one of the auxiliary registers. These registers are 
pointed to by a three-bit auxiliary register pointer (ARP) that is loaded with a 
value from 0 through 7, designating ARO through AR7, respectively. The auxil- 
lary registers and the ARP may be loaded from data memory, the accumulator, 
the product register, or by an immediate operand defined in the instruction. 
The contents of these registers may also be stored in data memory or used 
as inputs to the CALU. These registers appear in the memory map as de- 
scribed in Table 3—2. 
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Figure 3-10. Indirect Auxiliary Register Addressing Example 


Auxiliary Register File Data Memory Map 
ARO 053 7 h Location 
0000h 
Auxiliary Register a) S15 0h 
Pontet aR2 [0 E9 FCh 


(in STO) 


ARP [o[1[1]——ar3 [0 F F 3 A _h|——® OFF3Ah 
AR4 | 103 Bh OFFFFh 


ars [26 B1h 
AR6 000 8h 
AR7 8 43 Dh 


The auxiliary register file (ARO—AR7) is connected to the auxiliary register 
arithmetic unit (ARAU), shown in Figure 3-11. The ARAU may autoindex the 
current auxiliary register while the data memory location is being addressed. 
Indexing either by +1 or by the contents of the INDX register may be per- 
formed. As a result, accessing tables of information does not require the cen- 
tral arithmetic logic unit (CALU) for address manipulation; thus, the CALU is 
free for other operations in parallel. 


lf more advanced address manipulation is required, such as multidimensional 
array addressing, the CALU can directly read from or write to the auxiliary reg- 
isters. However, the ARAU update of the ARs is done during the decode phase 
(second cycle) of the pipeline, while the CALU write is done during the execu- 
tion phase (fourth cycle) of the pipeline. Therefore, the two instructions directly 
following the CALU write to an auxiliary register should not use the same auxil- 
lary register for address generation. See subsection 3.6.2 for details. 
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Figure 3-11. Auxiliary Register File 


8 LSB of Instruction Register 
(IREG) (16) 


Auxiliary 
Register 
Buffer 
(ARB) (3) | 


Auxiliary 
Register 
Pointer 
(ARP) (3) | 


) 
Auxiliary Register 2 (AR2) (16) 
Auxiliary Register 1 (AR1) (16) 
Auxiliary Register 0 (ARO) (16) 


3 


AEX 


YvyVvVVVYVvyYyVYys 


16 


INB OUT IN A 


Auxiliary Register Arithmetic Unit (ARAU) (16) | % ee 


Register 


Auxiliary Register File Bus (AFB) 3, : (IREG) (16) 


16 


As shown in Figure 3—11, the index register, compare register, or the eight 
LSBs of the instruction register can be used as one of the inputs of the ARAU. 
The other input is fed by the current AR (being pointed to by ARP). AR(ARP) 
refers to the contents of the current AR pointed to by ARP. The ARAU performs 
the functions shown in Table 3-3. 
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Table 3—3. Auxiliary Register Arithmetic Unit Functions | 


| Auxillary Register Operation Description 
AR(ARP) + INDX — AR(ARP) Index the current AR by adding an unsigned 16-bit in- 
teger contained in INDX. Example: ADD *0+ 
AR(ARP) — INDX — AR(ARP) Index the current AR by subtracting an unsigned 
16-bit integer contained in INDX. Example: ADD *0— 


AR(ARP) + 1 — AR(ARP) 


Increment the current AR by one. Example: ADD *+ 


EARN NR NW TR Po a 


| “AR(ARP) — 1 —> AR(ARP) _ —_ | Decrement the current AR by one. Example: + 
AR(ARP) -> AR(ARP) | Do not modify the current AR. Example: ADD * 


| AR(ARP) + IR(7—0) — AR(ARP) Add an 8-bit immediate value to current AR Example | 
| ADDRK *55h 

| Subtract an 8-bit immediate value from current AR. | 

Example: SBRK *55h : 


| AR(ARP) + rc(INDX) > AR(ARP) Bit-reversed indexing; add INDX with reversed-carry | 
(rc) propagation. Example: ADD *BRO+ 


Bit-reversed indexing; subtract INDX with re- 
versed-carry (rc) propagation. Example: ADD *BRO— 


‘Compare the current AR to ARCR and, if condition is 
true, then set TC bit of the status register ST1 to one. 
If false, then clear the TC bit. Example: CMPR 3 : 


[ AR(ARP) —IR(7-0) > AR(ARP) 


AR(ARP) — rc(INDX) > AR(ARP) 


If (AR(ARP)) = (ARCR) then TC = 1 
If (AR(ARP)) < (ARCR) then TC = 1 
If (AR(ARP)) > (ARCR) then TC = 1 
If (AR(ARP)) « (ARCR) then TC = 1 


i (AR(ARP)) = (CBER) then AR(ARP) = CBSR 


If at end of circular buffer, reload start address. The 
| test for this condition is done prior to the execution of 
| the auxiliary register modification. Example: ADD *+ 


The index register (INDX) can be added to or subtracted from AR(ARP) on any 
AR update cycle. This 16-bit register is one of the memory-mapped registers 
and is used to increment or decrement the address in steps larger than one, 
which is useful for operations such as addressing down a column of a matrix. 
The auxiliary register compare register (ARCR) is used as a limit to blocks of 
data and, in conjunction with the CMPR instruction, supports logical compari- 
sons between AR(ARP) and ARCR. Note that the C25 uses ARO for these two 
functions. After reset, a LAR load of ARO also loads INDX and ARCR to main- 
tain compatibility with the ‘C25. The splitting of functions to the three registers 
is enabled by setting the NDX bit of PMST to one. 


Because the auxiliary registers are memory-mapped, they can be acted upon 
directly by the CALU to provide for more advanced indirect addressing tech- 
niques. For example, the multiplier can be used to calculate the addresses of 
three-dimensional matrices. After a CALU load of the auxiliary register, there 
is, however, a two-instruction-cycle delay before auxiliary registers can be 
used for address generation. The INDX and ARCR registers are accessible 
via the CALU, regardless of the condition of the NDX bit (i.e., SAMM ARCR 
writes only to the ARCR). 
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In addition to its use for address manipulation in parallel with other operations, 
the ARAU may also serve as an additional general-purpose arithmetic unit be- 
cause the auxiliary register file can directly communicate with data memory. 
The ARAU implements 16-bit unsigned arithmetic, whereas the CALU imple- 
ments 32-bit 2s-complement arithmetic. The BANZ and BANZD instructions 
permit the auxiliary registers to be used as loop counters. 


The 3-bit auxiliary register pointer buffer (ARB), shown in Figure 3-11, provides 
storage for the ARP on subroutine calls when the automatic context switch fea- 
ture of the device are not used. 


Two circular buffers can operate at a given time and are controlled via the cir- 
cular buffer control register (CBCR). The CBCR is defined as shown in 
Table 3—4. 


Table 3—4. Circular Buffer Control Register (CBCR) 


Identifies which auxiliary register is mapped to circular buffer 1. 


Circular buffer 1 enable=1/disable=0. Set to O upon reset. 


CAR2 Identifies which auxiliary register is mapped to circular buffer 2 
Circular buffer 2 enable=1/disable=0. Set to 0 upon reset. 


Upon reset (RS rising edge), both circular buffers are disabled. To define a cir- 
cular buffer, load the CBSR1/2 with the start address of the buffer and 
CBER1/2 with the end address, and load the auxiliary register to be used with 
the buffer with an address between the start and end addresses. Finally, load 
CBCR with the appropriate auxiliary register number and set the enable bit. 
Note that the same auxiliary register can not be enabled for both circular buff- 
ers, or unexpected results will occur. As the address is stepping through the 
circular buffer, the auxiliary register value is compared against the value con- 
tained in CBER prior to the update to the auxiliary register value. If the current 
auxiliary register value and the CBER are equal and an auxiliary register modi- 
fication occurs, the value contained in CBSR is automatically loaded into the 
AR. If the values in the CBER and the auxiliary register are not equal, the auxil- 
lary register is modified as specified. 


Circular buffers can be used with either increment- or decrement-type up- 
dates. If increment is used, then the value in CBER must be greater than the 
value in CBSR. If decrement is used, the value in CBER must be less than the 
value in CBSR. The other indirect addressing modes can be used; however, 
the ARAU tests only for the condition AR(ARP) = CBER. The ARAU will not 
detect an AR update that steps over the value contained in CBER. See sub- 
section 4.1.6 for further details. | 


3.4.4 Memory-to-Memory Moves 


The ’C5x provides instructions for data and program block moves and for data 
move functions that efficiently utilize the memory spaces of the device. 
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The BLDD instruction moves a block within data memory, the BLPD instruction 
moves a block from program memory to data memory, and the BLDP instruc- 
tion moves a block from data memory to program memory. One of the address- 
es of these instructions comes from the data address generator, while the oth- 
er comes either from along immediate constant or from the BMAR. When used 
with the repeat instructions (RPT and RPTZ), these instructions efficiently per- 
form block moves from on-chip or off-chip memory. 


Implemented in on-chip data RAM, the DMOV (data move) function is equiva- 
lent to that of the ‘C25. DMOV copies a word from the currentiy addressed data 
memory location in on-chip RAM to the next-higher location, while the data 
from the addressed location is being operated upon in the same cycle (e.g., 
by the CALU). An ARAU operation may also be performed in the same cycle 
when the indirect addressing mode is used. The DMOV function can imple- 
ment algorithms that use the z—! delay operation, such as convolution and dig- 
ital filtering, where data is being passed through atime window. The data move 
function is at its highest efficiency when operating in dual-access on-chip 
memory. When operating in single-access memory, it requires an additional 
cycle. It is contiguous across the boundary of blocks BO and B1. The MACD 
(multiply and accumulate with data move), MADD (multiply and accumulate 
with data move and coefficient address contained in BMAR), DMOV (data 
move) and LTD (load TREGO with data move and accumulate product) instruc- 
tions make use of the data move function. 


Aa a a ec | 
Note: 


The data move operation cannot be performed on external data memory. 


The TBLR/TBLW (table read/write) instructions transfer words between pro- 
gram and data spaces. TBLR reads words from program memory into data 
memory. TBLW writes words from data memory to program memory. 
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3.5 Central Arithmetic Logic Unit (CALU) 
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The ’C5x central arithmetic logic unit (CALU) contains a 16-bit scaling shifter, 
a16 x 16-bit parallel multiplier, a 32-bit arithmetic logic unit (ALU), a 32-bit 
accumulator (ACC), a 32-bit accumulator buffer (ACCB), and additional shift- 
ers at the outputs of both the accumulator and the multiplier. This section de- 
scribes the CALU components and their functions. Figure 3—12 is a block dia- 
gram showing the components of the CALU. The following steps occur in the 
implementation of a typical ALU instruction: 


1) Data is fetched from memory on the data bus, 

2) Datais passed through the scaling shifter and the ALU where the arithme- 
tic is performed, and 

3) The result is moved into the accumulator. 


One input to the ALU is always provided by the accumulator. The other input 
may be transferred from the product register (PREG) of the multiplier, the ac- 
cumulator buffer (ACCB), or the scaling shifter that is loaded from data 
memory or the accumulator (ACC). 


Architecture 


Central Arithmetic Logic Unit (CALU) 


Figure 3—12. Central Arithmetic Logic Unit 
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3.5.1 Scaling Shifter 


The ’C5x provides a scaling shifter that has a 16-bit input connected to the data 
bus and a 32-bit output connected to the ALU; see Figure 3—12. The scaling 
shifter produces a left shift of 0 to 16 bits on the input data. The shift count is 
specified by a constant embedded in the instruction word or by the value in 
TREG1. The LSBs of the output are filled with zeros; the MSBs may be either 
filled with zeros or sign-extended, depending upon the value of the SXM bit 
(sign-extension mode) of status register ST1. 
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The ’C5x also contains several other shifters that allow it to perform numerical 
scaling, bit extraction, extended-precision arithmetic, and overflow preven- 
tion. These shifters are connected to the output of the product register and the 
accumulator. 


3.5.2 ALU and Accumulator 
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The ’C5x 32-bit ALU and accumulator implement a wide range of arithmetic 
and logical functions, the majority of which execute in a single clock cycle. 
Once an operation is performed in the ALU, the result is transferred to the ac- 
cumulator where additional operations, such as shifting, may occur. Data that 
is input to the ALU may be scaled by the scaling shifter. 


The ALU is a general-purpose arithmetic/logic unit that operates on 16-bit 
words taken from data memory or derived from immediate instructions. In ad- 
dition to the usual arithmetic instructions, the ALU can perform Boolean opera- 
tions, facilitating the bit manipulation ability required of a high-speed controller. 
One input to the ALU is always supplied by the accumulator, and the other in- 
put may be furnished from the product register (PREG) of the multiplier, the 
accumulator buffer (ACCB), or the output of the scaling shifter (that has been 
read from data memory or from the ACC). After the ALU has performed the 
arithmetic or logical operation, the result is stored in the accumulator. For the 
following example, assume ACC = 0, PREG = 000222200h, PM = 00, and 
ACCB = 000333300h: 


LACC #01111h,8 ;ACC = 00111100. Load ACC from prescaling 


:shifter 

APAC sACC = 00333300. Add to ACC the 
;product register. 

ADDB ;ACC = 00666600. Add to ACC the 


saccumulator buffer. 


The 32-bit accumulator (ACC) can be split into two 16-bit segments for storage 
in data memory; see Figure 3—12. Shifters at the output of the accumulator 
provide a left shift of 0 to 7 places. This shift is performed while the data is being 
transferred to the data bus for storage. The contents of the accumulator re- 
main unchanged. When the postscaling shifter is used on the high word of the 
accumulator (bits 16 — 31), the MSBs are lost and the LSBs are filled with bits 
shifted in from the low word (bits 0 — 15). When the postscaling shifter is used 
on the low word, the LSBs are zero-filled. For the following example, assume 
ACC = OFF234567h: 


SACL TEMP1,7 ;TEMP1 = 0B380 ACC 
SACH TEMP2,7 ;TEMP2 = 91A2 ACC 


OFF234567. 
OFF234567. 


The ’C5x supports floating-point operations for applications requiring a large 
dynamic range. By performing left shifts, the NORM (normalization) instruction 
is used to normalize fixed-point numbers contained in the accumulator. The 
four bits of the TREG1 define a variable shift through the scaling shifter for the 
ADDT/LACT/SUBT instructions (add to / load to / subtract from accumulator 
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with shift specified by TREG1). These instructions are useful in denormalizing 
a number (converting from floating-point to fixed-point). They are also useful 
in execution of an automatic gain control (AGC) going into a filter. 


The single-cycle 1-bit to 16-bit right shift of the accumulator can efficiently align 
the accumulator’s contents. This, coupled with the 32-bit temporary buffer on 
the accumulator, enhances the effectiveness of the CALU in extended-preci- 
sion arithmetic. The accumulator buffer register (ACCB) provides a temporary 
storage place for a fast save of the accumulator. The ACCB can also be used 
as an input to the ALU. The minimum or maximum value in a string of numbers 
can be found by comparing the contents of the ACCB with the contents of the 
ACC. The minimum or maximum value is placed in both registers, and, if the 
condition is met, the carry bit (C) is set to 1. The minimum and maximum func- 
tions are executed by the CRLT and CRGT instructions, respectively. These 
operations are signed arithmetic operations. For the following examples, as- 
sume ACC=012345678h and ACCB= 07654321 0h: 


CRLT ;ACC = ACCB = 12345678. C = 1. 


CRGT ;ACC = ACCB = 76543210. C = 0. 


The accumulator’s overflow saturation mode may be enabled/disabled by set- 
ting/resetting the OVM bit of STO. When the accumulator is in the overflow sat- 
uration mode and an overflow occurs, the overflow flag is set and the accumu- 
lator is loaded with either the most positive or the most negative value repre- 
sentable in the accumulator, depending upon the direction of the overflow. The 
value of the accumulator upon saturation is O7FFFFFFFh (positive) or 
080000000h (negative). If the OVM (overflow mode) status register bit is reset 
and an overflow occurs, the overflowed results are loaded into the accumula- 
tor without modification. Note that logical operations cannot result in overflow. 


The ’C5x can execute a variety of branch instructions that depend on the status 
of the ALU and the accumulator. For example, execution of the instruction 
BCND can depend on a variety of conditions in the ALU and the accumulator. 
The BACC instruction allows branching to an address stored in the accumula- 
tor. The bit test instructions (BITT and BIT) facilitate branching on the condition 
of a specified bit in data memory. 


The ’C5x accumulator also has an associated carry bit that is set or reset, de- 
pending on various operations within the device. The carry bit allows more effi- 
cient computation of extended-precision products and additions or subtrac- 
tions. It is quite useful in overflow management. The carry bit is affected by 
most arithmetic instructions as well as the single-bit shift and rotate instruc- 
tions. It is not affected by loading the accumulator, logical operations, or other 
such non-arithmetic or control instructions. Examples of carry bit operations 
are shown in Figure 3—13. 
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Figure 3—13. Examples of Carry Bit Operations 
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C MSB LSB C MSB LSB 
xX F F F F F F F F ACC Xx 00000000 ACC 
TT, ee ae | ee ee 
1 00000000 0 FFFFFFFF 
C MSB LSB C MSB LSB 
» 4 7 F F F F F F F ACC xX 80000001 ACC 
+ = se ee ee 
0 80000000 1 7FFFFFFF 
C MSB LSB C MSB LSB 
1 00000000 ACC 0 FF F F F F F F ACC 
eos PS OF ADDC): ee eee ee 6 
0 00000001 1 FFFFFFFd 


Shown in the examples of Figure 3-13, the value added to or subtracted from 
the accumulator may come from the input scaling shifter, ACCB, or PREG. The 
carry bit is set if the result of an addition or accumulation process generates 
a carry; it is reset to zero if the result of a subtraction generates a borrow. 
Otherwise, it is cleared after an addition or set after a subtraction. 


The ADDC (add to accumulator with carry) and SUBB (subtract from accumu- 
lator with borrow) instructions use the previous value of carry in their addition/ 
subtraction operation. The ADCB (add ACCB to accumulator with carry) and 
the SBBB (subtract ACCB from accumulator with borrow) also use the pre- 
vious value of carry. 


The one exception to operation of a carry bit, as shown in Figure 3-13, is in 
the use of ADD with a shift count of 16 (add to high accumulator) and SUB with 
a shift count of 16 (subtract from high accumulator). This case of the ADD in- 
struction can set the carry bit only if a carry is generated, and this case of the 
SUB instruction can reset the carry bit only if a borrow is generated; otherwise, 
neither instruction affects it. This feature is useful for extended precision arith- 
metic, as discussed in Chapter 7. 


Two conditional operands, C and NC, are provided for branching, calling, re- 
turning, and conditionally executing according to the status of the carry bit. 
The CLRC, LST #1, and SETC instructions can also be used to load the carry 
bit. The carry bit is set to one on a hardware reset. 


The SFL and SFR (in-place one-bit shift to the left/right) instructions and the 
ROL and ROR (rotate to the left/right) instructions shift or rotate the contents 
of the accumulator through the carry bit. The SXM bit affects the definition of 
the SFR (shift accumulator right) instruction. When SXM = 1, SFR performs 
an arithmetic right shift, maintaining the sign of the accumulator’s data. When 
SXM = 0, SFR performs a logical shift, shifting out the LSBs and shifting in a 
zero for the MSB. The SFL. (shift accumulator left) instruction is not affected 
by the SXM bit and behaves the same in both cases, shifting out the MSB and 
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shifting in a zero. The repeat (RPT and RPTZ) instructions may be used with 
the shift and rotate instructions for multiple-bit shifts. 


The SFLB, SFRB, RORB, and ROLB instructions can shift or rotate the 65-bit 
combination of the accumulator, ACCB, and carry bit as described above. 


The accumulator can also be right-shifted 0 — 31 bits in two instruction cycles 
or 1 — 16 bits in one cycle. The bits shifted out are lost, and the bits shifted in 
are either zeros or copies of the original sign bit, depending on the value of the 
SXM status bit. A shift count of 1 to 16 is embedded in the instruction word of 
the BSAR instruction. For example, let ACC = 012345678h: 


BSAR 7 ;ACC = 02468ACE. 


The right shift can also be controlled via TREG1. The SATL instruction shifts 
the ACC by 0-15 bits as defined by bits 3—0 of TREG1. The SATH instruction 
shifts the ACC 16 bits to the right if bit 4 of TREG1 is a 1. The following code 
sequence executes a 0- to 31-bit right shift of the ACC based on the shift count 
stored at SHFT. As an example, consider the value stored at SHFT = 01Bh and 
ACC = 012345678h: 


LMMR TREG1,SHFT *TREG1] = shift count 0 — 31. TREG1 = 1B 


SATH sIf shift count > 15, then ACC >> 16 
sACC = 00001234 
SATL ;ACC >> shift count. ACC = 00000002 


3.5.3 Multiplier, TREGO, and PREG 


The ’C5x uses a 16 x 16-bit hardware multiplier that is capable of computing 
a signed or an unsigned 32-bit product in a single machine cycle. All multiply 
instructions, except the MPYU (multiply unsigned) instruction, perform a 
signed multiply operation in the multiplier. That is, two numbers being multi- 
plied are treated as 2s-complement numbers, and the result is a 32-bit 
twos-complement number. As shown in Figure 3—12, the following two regis- 
ters are associated with the multiplier: 


[_} 16-bit temporary register (TREGO) that holds one of the operands for the 
multiplier, and 


[_} 32-bit product register (PREG) that holds the product. 


Four product shift modes (PM) are available at the PREG’s output. These shift 
modes are useful for performing multiply/accumulate operations, performing 
fractional arithmetic, or justifying fractional products. The PM field of status 
register ST1 specifies the PM shift mode, as shown in Table 3—5. 


Table 3—5. Product Shift Modes 


CPM | Resulting shit 
To Nosh 
0 
ai 


Left shift of 4 bits 


Right shift of 6 bits 
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The product is shifted one bit to compensate for the extra sign bit gained in mul- 
tiplying two 16-bit 2s-complement numbers (MPY). The four-bit shift is used 
in conjunction with the MPY instruction with a short immediate value (13 bits 
or less) to eliminate the four extra sign bits gained in multiplying a16-bit num- 
ber times a 13-bit number. The output of PREG can, instead, be right-shifted 
6 bits to enable the execution of up to 128 consecutive multiply/accumulates 
without the possibility of overflow. Note that, when the right shift is specified, 
the product is always sign-extended, regardless of the value of SXM. 


The LT (load TREGO) instruction normally loads TREGO to provide one oper- 
and (from the data bus), and the MPY (multiply) instruction provides the sec- 
ond operand (also from the data bus). A multiplication can also be performed 
with a short or long immediate operand by using the MPY instruction with an 
immediate operand. A product can be obtained every two cycles except when 
a long immediate operand is used. 


Four multiply/accumulate instructions (MAC, MACD, MADD, and MADS) fully 
utilize the computational bandwidth of the multiplier, allowing both operands 
to be processed simultaneously. The data for these operations can be trans- 
ferred to the multiplier each cycle via the program and data buses. This facili- 
tates single-cycle multiply/accumulates when used with repeat (RPT and 
RPTZ) instructions. In these instructions, the coefficient addresses are gener- 
ated by the PC, while the data addresses are generated by the ARAU. This 
allows the repeated instruction to sequentially access the values from the co- 
efficient table and step through the data in any of the indirect addressing 
modes. The RPTZ instruction also clears the accumulator and the product reg- 
ister to initialize the multiply/accumulate operation. As an example, consider 
multiplying the row of one matrix times the column of a second matrix. For this 
example, consider 10 x 10 matrices, MTRX1 points to the beginning of the 
first matrix, INDX = 10, and AR(ARP) points to the beginning of the second ma- 
trix: 

RPTZ #9 sFor i= 0, i< 10, i++ 

MAC MTRX1,*0+ ;PREG=DATA(MTRX1+i) x DATA[MTRX2 + (ixINDX) ] 


sACC += PREG. 
APAC sACC += PREG. 


The MAC and MACD instructions obtain their coefficient pointer from a long 
immediate address and are, therefore, two-word instructions. The MADS and 
MADD instructions obtain their coefficient pointer from the BMAR and are, 
therefore, one-word instructions. The use of the BMAR as a source to the coef- 
ficient table enables one block of code to support multiple applications and 
makes it unnecessary to modify executable code to change the long immedi- 
ate address. The MACD and MADD instructions also include a data move 
(DMOV) operation that, in conjunction with the fetch of the data multiplicand, 
writes the data value to the next-higher data address. 


The MACD and MADD instructions, when repeated, support filter constructs 
(weighted running averages) so that as the sum-of-products is executed, the 
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sample data is shifted in memory to make room for the next sample and to 
throw away the oldest sample. Circular addressing with MAC and MADS 
instructions may also be used to support filter implementation. 


For the example below, AR(ARP) points to the oldest of the samples. BMAR 
points to the coefficient table. In addition to initiating the repeat operation, the 
RPTZ instruction also clears the accumulator and the product register. In this 
example, the PC is stored in a temporary register while the repeated operation 
is executed. Next, the PC is loaded with the value stored in BMAR. The pro- 
gram bus is used to address the coefficients and, as the MADD is repeatedly 
executed, the PC increments to step through the coefficient table. The ARAU 
generates the address of the sample data. Indirect addressing with decrement 
steps the sample data, starting with the oldest data. As the data is fetched, it 
is also written to the next higher location in data memory. This operation aligns 
the data for the next execution of the filter by moving the oldest sample out past 
the end of the sample’s array and making room for the new sample at the be- 
ginning of the sample array. The previous product (PREG) is added to the ac- 
cumulator (ACC), while the two fetched values are multiplied and the product 
loaded into the PREG. Note that the DMOV portion of the MACD and MADD 
instructions will not function with external data memory addresses. 


RPTZ #9 sACC = PREG = 0. For I = 9 TO 0 Do 
MADD *— ;SUM Ay X Xz. Xrx41 = Xz.- 
APAC ;FINAL SUM. 


The MPYU instruction performs an unsigned multiplication, which greatly facil- 
itates extended-precision arithmetic operations. The unsigned contents of 
TREGO are multiplied by the unsigned contents of the addressed data memory 
location, with the result placed in PREG. This allows operands of greater than 
16 bits to be broken down into 16-bit words and processed separately to gener- 
ate products of greater than 32 bits. The SQRA (square/add) and SQRS 
(Square/subtract) instructions pass the same value to both inputs of the multi- 
plier for squaring a data memory value. 


After the multiplication of two 16-bit numbers, the 32-bit product is loaded into 
the 32-bit product register (PREG). The product from the PREG may be trans- 
ferred to the ALU or to data memory via the SPH (store product high) and SPL 
(store product low). 
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System control on the ’C5x is provided by the program counter, hardware 
stack, PC-related hardware, external reset signal, interrupts (see Section 3.8), 
status registers, and repeat counters. The following subsections describe the 
function of each of these components in system control and pipeline operation. 


Program Address Generation and Control 


The ’C5x has a 16-bit program counter (PC) and an eight-deep hardware stack 
for PC storage. The program counter addresses internal and external program 
memory in fetching instructions. The stack is used during interrupts and sub- 
routines. 


The program counter addresses program memory, either on-chip or off-chip, 
via the program address bus (PAB). Through the PAB, an instruction is ad- 
dressed in program memory and loaded into the instruction register (IREG). 
When the IREG is loaded, the PC is ready to start the next instruction fetch 
cycle. 


The PC can be loaded in a number of ways. When code is sequentially ex- 
ecuted, the PC is loaded with PC + 1. When a branch is executed, the PC is 
loaded with the long immediate value directly following the branch instruction. 
In the case of a subroutine call, the PC+2 is pushed onto the stack and then 
loaded with the long immediate value directly following the call instruction. The 
return instructions pop the stack back into the PC to return to the calling or in- 
terrupting sequence of code. In the case of a software trap or interrupt trap, 
the PC is loaded with the address of the appropriate trap vector. The contents 
of the accumulator may be loaded into the PC to implement computed GOTO 
operations. This can be accomplished with the BACC (branch to address in 
accumulator) or CALA (call subroutine at location specified by ACC) instruc- 
tions. 


The PAB bus can also address data stored in either program or data space. 
This makes it possible, in repeated instructions, to fetch a second operand in 
parallel with the data bus for two-operand operations. When repeated, the 
array addressed by the PAB is sequentially accessed via the incrementing of 
the PC. The block transfer instructions (BLDD, BLDP, and BLPD) use both 
buses so that, when repeated, the pipeline structure can be reading the next 
operand while writing the current one. The BLPD instruction loads the PC with 
either the long immediate address following the BLPD or with the contents of 
the block move address register (BMAR). The PAB bus is then used to fetch 
the source data from program space in this block move operation. The BLDP 
executes in the same way, except that the PAB bus is used for the destination 
operation. The BLDD instruction uses the PAB bus to address data space. 


The TBLR and TBLW instructions operate much like the BLPD and BLDP in- 
structions, respectively, except that the PC is loaded with the low 16 bits of the 
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accumulator instead of the BMAR or long immediate address. This facilitates 
calculated table look-up operations. The multiply/accumulate operations 
(MAC, MACD, MADD, and MADS) use the PAB bus to address their coefficient 
table. The MAC and MACD instructions load the PC with the long immediate 
address following the instruction. The MADD and MADS instructions load the 
PC with the contents of BMAR. 


To start a new fetch cycle, the PC is loaded either with PC +1 or with a branch 
address (for instructions such as branches, calls, and interrupts). In the case 
of conditional branches where the branch is not taken, the PC is incremented 
once more beyond the location of the branch immediate address. In addition 
to the conditional branches, the ’C5x has a full complement of conditional calls, 
executes, and returns. These instructions execute according to the following 
conditions: 


[Accumulator equaltozero = 
| 
Tat | Acc>0 | Accumulatorgreaterthan zero 

Accumulatorcary settoone | 


NOV V 


1 
0 | No accumulator overflow detected 


BIO is low BIO signal is low 


(tT | TC =1 | Test/control flag set to one — 


Unconditional operation 


Multiple conditions can be defined in the operands of the conditional instruc- 
tions. If multiple conditions are defined, all conditions must be met. For exam- 
ple, 


BCND BRANCH,LT,NOV :If ACC < 0 and no overflow. 


In this example, both conditions must be met (that is, OV = 0 and ACC < 0) for 
the branch to be taken. 


The conditional branch is a two-word instruction. The conditions for the branch 
are not stable until the fourth cycle of the branch instruction pipeline execution, 
because the previous instruction must have completely executed for the accu- 
mulator’s status bits to be accurate. Therefore, the pipeline controller stops the 
decode of instructions following the branch until the conditions are valid. If the 
conditions defined in the operands of the instruction are met, then the PC is 
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loaded with the second word and the core CPU starts refilling the pipeline with 
instructions at the branch address. Because the pipeline has been flushed, the 
branch instruction has an effective execution time of four cycles if the branch 
is taken. If, however, any of the conditions are not met, the pipeline controller 
allows the next instruction (already fetched) to be decoded. This means that 
if the branch is not taken, the effective execution time of the branch is two 
cycles. 


The subroutine call can also be executed conditionally. The CC instruction op- 
erates like the BCND except that the PC pointing to the instruction following 
the CC is pushed onto the PC stack. This sets up the return (by RET) to pop 
the stack to return to the calling sequence. A subroutine or function can have 
multiple return paths based upon the data being processed. Using conditional 
returns (RETC) avoids the need for conditionally branching around the return. 
For example, 


CC OVER_FLOW,OV ;If overflow,then execute the 
; ;overflow—handling routine. 


OVER_FLOW ;Overflow—handling routine. 
RETC GEQ If ACC >= 0, then return. 
RET ;Return. 


In the example, an overflow-handling subroutine is called if the main algorithm 
causes an overflow condition. During the subroutine, the ACC is checked and, 
if it is positive, the subroutine returns to the calling sequence. If not, additional 
processing is necessary before the return. Note that RETC, like RET, is a 
single-word instruction. However, because of the potential PC discontinuity, it 
still operates with the same effective execution time as BCND and CC. 


To avoid flushing the pipeline and causing extra cycles, the ’C5x has a full set 
of delayed branches, calls, and returns. In the delayed operation of branches, 
calls, or returns, the two-instruction words following the delayed instruction are 
executed while the instructions at and following the branch address are being 
fetched—therefore, giving an effective two-cycle branch instead of flushing 
the pipeline. If the instruction following the delayed branch is a two-word in- 
struction, only that instruction is executed before the branch is taken. For ex- 
ample, 

OPL #030h,PMST 

BCND NEW_ADRS,EQ 


or 


BCNDD NEW _ADRS,EQ 
OPL #030h,PMST. 
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The first code segment takes six cycles to execute (two for the OPL and four 
for the BCND). The second code segment takes four cycles because the two 
dead cycles following the BCNDD are filled with the OPL instruction. Note that 
the condition tested on the branch is not affected by the OPL instruction, thus, 
allowing it to be executed after the branch. 


In cases where the conditional branch is used to skip over one or two words 
of code, the branch can be replaced with the conditional execute instruction. 
For example, 


BCND SUM,NC 


ADD ONE 
SUM APAC 
or 
XC 1,C 
ADD ONE 
APAC 


The first code segment takes six cycles. The second code segment takes three 
cycles. If the condition is met in the second code segment, the ADD is ex- 
ecuted. If the condition is not met, then a NOP is forced in the instruction regis- 
ter over the ADD. Note that the condition must be stable one full cycle before 
the XC instruction is executed. This is to assure that the decision is made on 
the condition before the instruction following the XC is decoded (auxiliary reg- 
ister updates occur during the decode phase of an instruction, so the instruc- 
tion must be stopped before the decode to make sure it is not executed). The 
user should avoid affecting the XC test conditions one instruction word before 
the XC. Without interrupts, this instruction will have no effect on the XC. How- 
ever, with interrupts, an interrupt can trap between the instruction and the XC 
so that the condition is affected prior to the XC execution. The following exam- 
ples show this cycle dependency: 


LACL #0 sACC = 0. 

ADD ‘TEMP1 2ACC = TEMP1. 

XC 2,EQ sIf ACC == 0, 

SPLK #0EEEEh, TEMP2 Then TEMP2 = EEEE. 
or 

LACL #0 sACC = 0. 

ADD #01234h sACC = 00001234. 
XC 2,EQ sIf ACC == 0, 


SPLK #0QEEEEh, TEMP2 *Then TEMP2 is unmodified. 


In the first code segment, TEMP2 = EEEE. The NEQ status, caused by the 
ADD instruction, is not established at the time the decision is made by the XC 
instruction. Therefore, the previous condition of EQ, caused by the ZAC in- 
struction, determines the conditional execute. Since this condition is met, 
TEMP2 is loaded by the SPLK instruction. Note that interrupts can trap before 
the XC and after the ADD so that the SPLK will not execute. In the second code 
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segment, TEMP2 is not set to EEEE. The NEQ status, caused by the ADD in- 
struction, is established one full cycle before the XC execution phase because 
the long immediate value (#01234h) used in the ADD caused it to be a 
two-cycle instruction. Since the condition is not met, a NOP is forced over both 
words of the two-word SPLK instruction, and, therefore, TEMP2 is not af- 
fected. Note that interrupts have no effect on this instruction sequence. 


The ’C5x also has a feature that allows the execution of a single instruction N 
+ 1 times where N is the value loaded in a 16-bit repeat counter (RPTC). If the 
repeat feature is used, the instruction is executed and the RPTC is decrem- 
ented until the RPTC goes to zero. This feature is useful with many instruc- 
tions, such as NORM (normalize contents of accumulator), MACD (multiply 
and accumulate with data move), and SUBC (conditional subtract). As instruc- 
tions repeat, the program address and data buses are freed to fetch a second 
operand in parallel with the data address and data buses. This allows instruc- 
tions such as MACD and BLPD to effectively execute in a single cycle when 
they repeat. See Section 7.6, Single Instruction Repeat Loops, for details on 
these instructions. 


The stack is 16 bits wide and eight levels deep. The PC stack is accessible 
through the use of the PUSH and POP instructions. Whenever the contents 
ofthe PC are pushed onto the top of the stack, the previous contents of each 
level are pushed down, and the bottom (eighth) location of the stack is lost. 
Therefore, data will be lost if more than eight successive pushes occur before 
a pop. The reverse happens on pop operations. Any pop after seven sequen- 
tial pops yields the value at the bottom stack level, and all of the stack levels 
then contain the same value. Two additional instructions, PSHD and POPD, 
push a data memory value onto the stack or pop a value from the stack to data 
memory. These instructions allow a stack to be built in data memory for the 
nesting of subroutines/interrupts beyond eight levels. See Section 7.3, Soft- 
ware Stack, for details on software stack. 


3.6.2 Pipeline Operation 
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Instruction pipelining consists of the sequence of bus operations that occur 
during instruction execution. In the operation of the pipeline, the instruction 
fetch, decode, operand fetch, and execute operations are independent, which 
allows overall instruction executions to overlap. Thus, during any given cycle, 
one to four different instructions can be active, each at a different stage of com- 
pletion, resulting in a four-deep pipeline. Figure 3—14 shows the operation of 
the four-level pipeline for single-word single-cycle instructions executing with 
no wait states. The pipeline is essentially invisible to the user except in some 
cases, such as auxiliary register updates, memory-mapped accesses of the 
CPU registers, the NORM instruction, and memory configuration commands. 
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Figure 3-14. Four-Level Pipeline Operation 
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ARAU updates of auxiliary registers execute during the decode (second 
phase) of the pipeline. This allows the address to be generated before the op- 
erand fetch phase. However, memory-mapped accesses (e.g., SAMM, 
LMMR, SACL, or SPLK) of these registers happen on the execute phase of 
the pipeline. This means that the next two instructions after a memory-mapped 
load of the auxiliary register should not use this auxiliary register. In addition, 
modifications to the memory-mapped registers INDX and ARCR also occur in 
the execution phase of the pipeline. Therefore, any auxiliary register updates 
using the INDX register or auxiliary register compares using the ARCR register 
must occur at least two cycles after a load of these registers. The following 
code examples illustrate the effects of a memory-mapped write to an auxiliary 
register: 


EXAM1 LARAR2,#067h ;AR2 = 67. 
LACC #064h ;ACC = 00000064. 
SAMM AR2 ;This update is overridden by *— up— 
:;dates on the next two instructions. 
LACC *— ;AR2 = 66. 
ADD *— ;AR2 = 65. 
or 
EXAM2 LARAR2,#067h ;AR2 = 67. 
LACC #064h sACC = 00000064. 
SAMM AR2 ;LACC *— update happens before 
*SAMM write. 
LACC ‘— 7;AR2 = 66. 
NOP ;AR2 = 64 {SAMM write to AR2 happens 
;in parallel with the NOP. 
ADD *— ;AR2 = 63. 
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or 

EXAM3 LAR AR2,#067h ;AR2 = 67. 
LACC #064h ;ACC = 00000064. 
SAMM AR2 *AR2 = 64. 
NOP ;Pipeline protection. 
NOP ;Pipeline protection. 
LACC *— ;AR2 = 63. 
ADD *— ;AR2 = 62. 


In EXAM1, the decode phase of the ADD instruction is on the same cycle as 
the execute (write) phase of the SAMM instruction. Both of these instructions 
are trying to load AR2. The ADD *— update does load AR2, while the SAMM 
execution is voided. In EXAM2, a NOP is strategically placed to avoid the con- 
flict between the ADD *— update of the AR2 and the SAMM write to AR2. In this 
code’s sequence: 


AR2= 67 —~> 66 —~> 64 ~ 683 


Note that the LACC address is based on the value in AR2 before the SAMM 
write to AR2. in EXAM3, the SAMM write to AR2 is completed before either the 
LACC or the ADD have updated AR2. Any two instruction words that do not 
update AR2 can be used in place of the two NOP instructions. This could be 
two one-word instructions or one two-word instruction. The results obtained 
by EXAM1 and EXAM2 code examples may be different if the code is interrupt- 
ible. The user should avoid writing code similar to EXAM1 and EXAM2. 


The pipeline effect described above requires writes to memory-mapped regis- 
ters to allow for a latency between the write and an access of that register. 
These registers can be accessed by ’C5x instructions in the decode and oper- 
and fetch phases of the pipeline. Table 3—6 outlines the latency required be- 
tween an instruction that writes the register via its memory-mapped address, 
and the access of that register by subsequent instructions. Note that all direct 
accesses to the registers that do not use memory-mapped addressing (such 
as all ’C25-compatible instructions, like LAR, LT, etc) are pipelined-protected 
and, hence, do not require any latency. 


Architecture 


Table 3-6. Latencies Required 
[Name Description 


[Next 2words use previousmap =| 
[Next Tworduses oldbitaddress 
word update gets over written 


| 2 | Next 2 words use previous value 


[GREG | Global memory allocation register 


Auxiliary registers 0-7 
INDX 
ARCR Auxiliary register compare register 


i es 
oBSR | Greularbufer star registers and? 
CCBER |" reular bur ond registers Tand@ [2 
Circular buffer control register 
: 
"POWER | Progranfdata SW wal sat egier | 1 | Newt word uses previous count 


CWSR SMW wait state control register | 1 | Next 1 word uses previous modes 
Configuration bit in ST1 register | 2 | Next 2 words use previous map 


The NORM instruction affects AR(ARP) during its execute phase of the pipe- 
line. The same pipeline management, as described above, works in this case. 
The assembler can detect an auxiliary register update or store (SAR) directly 
after a NORM instruction and insert NOP instructions automatically to main- 
tain source-code compatibility with the C25 (—p option). 


The ’C5x core CPU supports the reconfiguration of memory segments, both 
internal and external to the device. The reconfiguration operations happen 
during the execute phase of the pipeline. Therefore, before an instruction uses 
the new configuration, at least two instruction words should follow the instruc- 
tion that reconfigures memory. In the following example, assume AR(ARP) = 
0200h and RAMBO(0) = 1. 


CLRC CNF ;Map RAM BO to data space. 
LACC #01234h ;ACC = 00001234. 
ADD * ;AcC = 00001235. 


Notice the use of the LACC #01234h to fill the two-word requirement. Because 
along immediate operand is used, this is atwo-word instruction and, therefore, 
meets the requirement. This also applies to memory configurations controlled 
by the PMST register. 
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3.6.3 Status and Control Registers 
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There are four key status and control registers for the 'C5x core. STO and ST1 
contain the status of various conditions and modes compatible with the ’C25, 
while PMST and CBCR contain extra status and control information for control 
of the enhanced features of the ’C5x core. These registers can be stored into 
data memory and loaded from data memory, thus allowing the status of the 
machine to be saved and restored for subroutines. STO, ST1, and PMST each 
have an associated one-deep stack for automatic context-saving when an in- 
terrupt trap is taken. The stack is automatically popped upon a return from in- 
terrupt (RETI or RETE). Note that the XF bit in ST1 is not saved on the one- 
deep stack or restored from that stack on an automatic context save. This fea- 
ture allows the XF pin to be toggled in an interrupt service routine while still 
allowing automatic context saves. 


The PMST and CBCR registers reside in the memory-mapped register space 
in page zero of data memory space. Therefore, they can be acted upon directly 
by the CALU and the PLU. They can be saved in the same way as any other 
data memory location. Note that the CALU and the PLU operations change the 
bits of these status registers during the execute phase of the pipeline. The next 
two instruction words, following an update of these status registers, may not 
be affected by the reconfiguration caused by the status update, as shown in 
Table 3-6. 


The LST instruction writes to STO and ST1, and the SST instruction reads from 
them, except that the INTM bit is not affected by the LST instruction. Unlike the 
PMST and CBCR registers, the STO and ST1 registers do not reside in the 
memory map and, therefore, cannot be handled by using the PLU instructions. 
The individual bits of these registers can be set or cleared with the SETC and 
CLRC instructions. For example, the sign-extension mode is set with SETC 
SXM or cleared with CLRC SXM. 


Figure 3-15 shows the organization of the four status registers, indicating all 
status bits contained in each. Several bits in the status registers are reserved 
and read as logic ones. Table 3—7 defines all the status/control bits. 
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Figure 3-15. Status and Control Register Organization 


15 14 13 12 11 10 9 8 7 6 § 4 3 2 ji Q 


oO. 
> 
STO cr 5 


=\-|E 
6 |" |Z 


15 1413 12 11:10 9 8 7 6 § 4 3 2 1 0 


ST 
15 14 13 12 1110 9 8 7 6 5 4 3 2 0 
N nu 
CBCR >| « 
5 O 
O 
(151413121110 9 8 7 6 5 4 3 0 
> 
PMST S 
| fo) 


Table 3—7. Status Register Field Definitions 


Auxiliary Register Pointer Buffer. Whenever the ARP is loaded, the old ARP value is copied to the ARB 
except during an LST instruction. When the ARB is loaded via an LST #1 instruction, the same value 
is also copied to the ARP. This is useful when restoring context (when not using the automatic context | 
save) in a subroutine that modifies the current ARP. 


Auxiliary Register Pointer. This three-bit field selects the AR to be used in indirect addressing. When 
the ARP is loaded, the old ARP value is copied to the ARB register. ARP may be modified by memory- | 
reference instructions when indirect addressing is used, and by the MAR and LST instructions. The 
ARP is also loaded with the same value as ARB when an LST #1 instruction is executed. 


Address VISibility Mode. This mode allows the internal program address to appear at the pins of the 
device so that the internal program address can be traced and the interrupt vector can be decoded 
in conjunction with [ACK when the interrupt vectors reside in on-chip memory. The internal program 
address is driven to the pins when AVIS = 0. The address lines do not change with internal program 
when AVIS = 1. Note that the control lines and data lines are not effected when AVIS = 0 and the ad- 
dress bus is driven with the last address on the bus. The AVIS bit is set to zero at reset. 


Block Repeat Active Flag. This bit indicates whether block repeat is currently active. Writing a zero 
to this bit deactivates block repeat. BRAF is set to zero upon reset. 


Carry Bit. This bit is set to 1 if the result of an addition generates a carry, or is reset to 0 if the result 
of a subtraction generates a borrow. Otherwise, it is reset after an addition or is set after a subtraction, | 
unless the instruction is ADD or SUB with a 16-bit shift. In these cases, the ADD can only set and the 
SUB only reset the carry bit, but they cannot affect it otherwise. The single-bit shift and rotate instruc- | 

tions, as well as the SETC, CLRC, and LST #1 instructions also affect this bit. C is set to 1 on a reset. 
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Table 3—7. Status Register Field Definitions (Continued) 


Circular Buffer 1 Auxiliary Register. These three bits identify which auxiliary register is assigned to circu- 
lar buffer 1. 


Circular Buffer 2 Auxiliary Register. These three bits identify which auxiliary register is assigned to circu- 
lar buffer 2. 


Circular Buffer 1 Enable. This bit, when set to 1, enables circular buffer 1. When CENB1 is set to 0, circu- 
lar buffer 1 is disabled. CENB1 is set to zero upon reset. 


CENB1 


CENB2 Circular Buffer 2 Enable. This bit, when set to 1, enables circular buffer 2. When CENB2 is set to 0, circu- 


lar buffer 2 is disabled. CENB2 is set to zero upon reset. 


CNF On-chip RAM Configuration Control Bit. If this bit is set to 0, the reconfigurabie-data dual-access RAM 
blocks are mapped to data space; otherwise, they are mapped to program space. The CNF may be modi- 
fied by the SETC CNF, CLRC CNF, and LST #1 instructions. RS sets the CNF to 0. 


Data Memory Page Pointer. The 9-bit DP register is concatenated with the 7 LSBs of an instruction word 
to form a direct memory address of 16 bits. DP may be modified by the LST and LDP instructions. 


= 


Hold Mode Bit. When HM = 1, the processor halts internal execution when acknowledging an active 
HOLD. When HM = 0, the processor may continue execution out of internal program memory but puts 
its external interface in a high-impedance state. This bit is set to 1 by reset. 


INTM Interrupt Mode Bit. When this bit is set to 0, all unmasked interrupts are enabled. When it is set to 1, all 
maskable interrupts are disabled. INTM is set and is reset by the SETC INTM and CLRC INTM instruc- 
tions. RS and TACK also set INTM. INTM has no effect on the unmaskabie RS and NMI interrupts. Note 
that INTM is unaffected by the LST instruction. This bit is set to 1 by reset. Itis also set to 1 when a mask- 
able interrupt trap is taken. It is reset to 0 when a RETE (return from interrupt with interrupt enable) is | 


executed. 


IPTR Interrupt Vector Pointer. These five bits point to the 2K page where the interrupt vectors reside. This al- 
lows you to remap the interrupt vectors to RAM for boot-lioaded operations. At reset, these bits are all 


set to zero. Therefore, the reset vector always resides at zero in the program memory space. 


Microprocessor/Microcomputer Bit. When this bit is set to zero, the on-chip ROM is enabled. When it is 
set to one, the on-chip ROM is not addressable. This bit is set to the value corresponding to the logic 
level on the MP/MC pin at reset. The level on the MP/MC pin is sampled at device reset only and can 
have no effect until the next reset. 


NDX Enable Extra Index Register. This bit configures indexed indirect addressing and auxiliary address regis- 
ter compare to operate either in a ’C2x-compatible mode (NDX = 0) or in a’C5x-enhanced mode (NDX 
= 1). When NDX = 0, any ’C2x-compatible instruction that modifies or loads ARO, also modifies/loads 
the INDX and ARCR registers in addition to ARO. This is because the ’C2x devices use ARO for indexing 
and AR compare operations. When NDX = 1, INDX and ARCR are not affected by any 'C2x-compatible 


instruction. NDX = 0 at reset. 


O 
< 


Overflow Flag Bit. As a latched overflow signal, OV is set to 1 when overflow occurs in the ALU. Once 
an overflow occurs, the OV remains set until a reset, BCND(D) on OV/NOV, or LST instruction clears 
OV. 


OVLY RAM Overlay Bit. This bit enables on-chip single-access program RAM cells to be mapped into data 
space. If OVLY is set to one, the block of memory is mapped into data space. If it is set to 0, the memory 
block is not addressable in data space. See Table 3-8 for the mappings of specific 'C5x devices. This 


bit is set to zero at reset. 
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Table 3-7. Status Register Field Definitions (Concluded) 


Overflow Mode Bit. When OVNM is set to 0, overflowed results overflow normally in the accumulator. 
When set to 1, the accumulator is set to either its most positive or most negative value upon encountering 
an overflow. The SETC and CLRC instructions set and reset this bit, respectively. LST may also be used 
to modify the OVM. 


Product Shift Mode. If these two bits are 00, the multiplier’s 32-bit product is not shifted when transferred 
to the ALU. If PM = 01, the PREG output is left-shifted one place when transferred to the ALU, with the 
LSB zero-filled. If PM = 10, the PREG output is left-shifted by four bits when transferred to the ALU, with | 
the LSBs zero-filled. PM = 11 produces a right shift of six bits, sign-extended. Note that the PREG con- 
tents remain unchanged. The shift also takes place when the contents of the PREG are stored to the 
data memory. PM is loaded by the SPM and LST #1 instructions. The PM bits are cleared by RS. 


Program RAM Enable. This bit enables mapping of on-chip single-access RAM blocks into program 
space. RAM set to 1 maps the memory block in program space. RAM set to 0 removes the memory block 
from the program space. See Table 3-8 for the mappings of specific 'C5x devices. This bit is set to zero 
at reset. 


Sign-Extension Mode Bit. SXM = 1 produces sign extension on data as it is passed into the accumulator 
through the scaling shifter. SXM = 0 suppresses sign extension. SXM does not affect the definitions of 
certain instructions; e.g., the ADDS instruction suppresses sign extension, regardless of SXM. This bit 
is set by the SETC SXM, reset by the CLRC SXM instructions, and may be loaded by the LST #1. SXM 
is set to 1 by reset. 


Test/Control Flag Bit. The TC bit is affected by the BIT, BITT, CMPR, LST #1, NORM, CPL, XPL, OPL, 
and APL instructions. The TC bitis set to a 1 if (1) a bit tested by BIT or BITT is a1, (2) acompare condition 
tested by CMPR exists between ARCR and another AR pointed to by ARP, (3) the exclusive-OR function 
of the two MSBs of the accumulator is true when tested by a NORM instruction, (4) the long immediate 
value is equal to the data value on the CPL instruction, or (5) the result of the logical function (KPL, OPL 
or APL) is zero. The TC bit can influence the execution of the conditional branch, call, and return instruc- | 
tions. 


Enable Multiple TREGs. This bit sets the 'C5x to operate in either 'C2x-compatible mode (TRM = 0) or 
‘C5x-enhanced mode (TRM=1) in conjunction with the use of the TREGO, TREG1, and TREG2 registers. 
This bit affects the operation of all ’C2x-compatible instructions that modify TREGO. The ’C2x CPU uses 
TREGx as a shift count for the prescaling shifter and as a bit address in the BITT instruction. When 
TRM=0, all ’C2x-compatible instructions write to all three of the 'C5x TREGs to maintain source compati- 
bility with the 'C2x devices. When TRM = 1, the LT instructions affect only TREGO. TRM = 0 upon reset. 


XF Pin Status Bit. This bit indicates the state of the XF pin, a general-purpose output pin. XF is set by 
the SETC XF and reset by the CLRC XF instructions. XF is set to 1 by reset. This bit is not saved or re- 
stored on an automatic context save during interrupt service routines. 


Table 3-8. On-Chip Single-Access RAM Configuration Control 


fF oO 86 Mapped into program space | 
PF ot Mapped into data space | _ 


OVLY RAM On-Chip SARAM Configuration | | 
a rr ae Mapped into both program and data spaces | 
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3.6.4 Repeat Counter 


RPTC is a 16-bit repeat counter, which, when loaded with a number N, causes 
the next single instruction to be executed N + 1 times. The RPTC register is 
loaded by either the RPT or the RPTZ instruction. This results in a maximum 
of 65,536 executions of a given instruction. RPTC is cleared by reset. The 
RPTZ instruction clears both ACC and PREG before the next instruction starts 
repeating. Once a repeat instruction (RPT or RPTZ) is decoded, all interrupts 
including NMI (except reset) are masked until the completion of the repeat 
loop. However, the device responds to the HOLD signal while executing an 
RPT/RPTZ loop. The RPTC register resides in the CPU’s memory-mapped 
register space; however, you should avoid writing to this register. 


The repeat function can be used with instructions such as multiply/accumu- 
lates (MAC and MACD), block moves (BLDD and BLPD), !/O transfers (IN/ 
OUT), and table read/writes (TBLR/TBLW). These instructions, although nor- 
mally multicycle, are pipelined when the repeat feature is used, and they effec- 
tively become single-cycle instructions. For example, the table read instruction 
may take three or more cycles to execute, but when the instruction is repeated, 
a table location can be read every cycle. Note that not all instructions can be 
repeated. Table 3-9 through Table 3-11 list all 'C5x instructions, according to 
their repeatability. 


Table 3-9. mepeatanie inst UCvOnS 


i Add to ACC aioe malted with lth 
BLDD Block move from data memory to data memory 


BSAR Barrel-shift ACC right 

| DMOV Move direct/indirect addressed data one location up in memory | 

a 
a Load memory-mapped register | 

Load TREGO direct/indirect and add PREG to ACC 

Load TREGO direct/indirect with data move and add PREG to ACC 
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Table 3-9. Repeatable Instructions (Continued) 


Repeatable Instructions Description 

TS Load TREGO direct/indirect and subtract PREG 

MAC Add PREG to ACC and multiply immediate addressed by direct/indirect 
MACD Add PREG to ACC and multiply immediate addressed by direct/indirect with data move 
MADD Add PREG to ACC and multiply BMAR addressed by direct/indirect with data move 
MADS Add PREG to ACC and multiply addressed by direct/indirect | 


MAR Modify AR 

OPL OR DBMR to direct/indirect addressed 
OUT Write to I/O space 

POP Pop the PC stack to low ACC 

ROL Rotate ACC left once 


ROLB Rotate combined ACC and ACCEB left once 
Rotate ACC right once 
| RORB Rotate combined ACC and ACCB right once 


DD 
J 


SAR AR,* Store AR indirect addressed 

SBB Subtract ACCB from ACC 

SFR Shift ACC right once 

SPH Store high PREG to direct/indirect addressed 
SPL Store low PREG to direct/indirect addressed 
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Table 3—9. Repeatable Instructions (Concluded) 


Repeatabie Instructions 


ecceateacanieer _ 

SRS Subtract PREG from AGC and equare drecvindrect addressed ————SOS~SCS 

TSuBB Subtract rom ACC diretindrect wih borow ———SSSS—S~—SCS 
! 


TBLW Write from data space to program space 


XPL XOR DBMR to direct/indirect addressed 


Table 3—10. Instructions Not Meaningful to Repeat 


| instructions Not Meaningful to Repeat Description | 
| ABS Absolute value of ACC 
AND AND to low ACC direct/indirect 
L 


ANDB AND ACCB to ACC 
Test bit in data word } 


CLRC Clear status bit 


Towpr SSS Compare AR ARPA SSSCSSCSC~S 
Tcrat_——~SSSS Copa AC to ACC and match larger valuo 
Portt—~—~SSSS Compare AC to ACR and match smaler vale 
Piacs SSCS a Aw ASS 
LACC = dma,shft Load ACC direct addressed with shift | 
| LACC *shft Load ACC indirect addressed with shift | 
LACL 


B | : 

5 mr 

BIT a : 
BITT | | __ Test bit (specified by TREG2) in data word : 
CMPL . “Complement ACC | 

. - Serer 

XA ith ACC 


Load low ACC direct/indirect and zero high ACC 


| LAR * AR Load AR indirect addressed | 
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Table 3-10. Instructions Not Meaningful to Repeat (Continued) 


Instructions Not Meaningful to Repeat Description | 


L Load high PREG with direct/indirect addressed 
Load status registers | 
L Load TREGO with direct/indirect addressed 


MPYU Multiply TREGO by direct/indirect unsigned 


7 | 
T 
LT Load TREGO direct/indirect and load ACC with PREG 
M 
N Negate ACC | | | 
> , 


=v ——n 
ST | 
— 
P | 7 
CMpy SSCS TREC By rectindrect SCS 
° ° . 
EG 
R indi 
RB 
Cc 


O OR to low ACC direct/indirect 
O OR ACCB to ACC 
A Load ACC with PREG | 


SACB Store ACC in ACCB 
SAR AR,dma Store AR direct addressed 7 


SETC | Set status bit | | : 
SPM Set PREG shift mode | | 
XOR XOR to low ACC direct/indirect | | 


zap aro tend REG 
| ZPRO ero PREG 


Table 3-11. Nonrepeatable Instructions 


Nonrepeatable Instructions | Description | | | | 
A # Add to ACC short immediate 
# 


DD i 
[AOD Akshih Aa o ACC ong immediate wR 
[ARO Aksih AND wAGC ong nmediatewinan ——SSC~=*dY 

APL i 


BANZ[D] Branch [delayed] on AR(ARP) not zero 
BCND[D] Branch [delayed] conditionally | 
CALAID] | Call [delayed] to address specified in low ACC | 


CC[D] Call [delayed] subroutine conditionally | 


CPL sl Compare long immediate to direct/indirect addressed _ | 


k 
Ik AND long immediate to direct/indirect addressed 
k 
IDLE | IdleCPU | 


BACC[D] Branch [delayed] to address specified in low ACC | 


CALLID] Call [delayed] subroutine | _ 


3-45 


System Control 


Table 3-11 .Nonrepeatable Instructions (Continued) 


Nonrepeatible Instructions Description 


IDLE2 idle until interrupt — low power mode 
INTR Soft interrupt 
LACC  #Ik, shit Load ACC long immediate 


LAC #k Load ACC short immediate 
LAR #lk Load AR with long immediate 
LD #k Load DP short immediate 
NM Non-maskable interrupt 
P Ik OR long immediate to direct/indirect addressed 
R , i 
E 
P 
Ik 
k 
1k | 


L 

Pp 

I | 

OPL #, irect/indi 

PRET Rat rom tariptsenicerouine 

RE Saar ripe 
R . ° ° 


T 
XC Execute next instruction conditionally 
XO #ik, shit XOR to ACC long immediate with shift 


XPL 


SBRK Subtract from AR short immediate 


# XOR long immediate to direct/indirect addressed 


3.6.5 Block Repeat 


The block repeat feature provides zero-overhead looping for implementation 
of FOR and DO loops. The function is controlled by three registers (PASR, 
PAER, and BRCR) and the BRAF bit in the PMST register. The block repeat 
counter register (BRCR) is loaded with a loop count of 0 to 65,535. Then, ex- 
ecution of the RPTB (repeat block) instruction loads the program address start 
register (PASR) with the address of the instruction following the RPTB instruc- 
tion and loads the program address end register (PAER) with its long immedi- 
ate operand. The long immediate operand is the address of the instruction fol- 
lowing the last instruction in the loop minus one. Note that the repeat block 
must contain at least three instruction words. Execution of the RPTB instruc- 
tion automatically sets active the BRAF bit. With each PC update, the PAER 
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is compared to the PC. If they are equal, the BRCR contents are compared to 
zero. If the BRCR is greater than zero, it is decremented, and the PASR is 
loaded into the PC, thus starting the loop over. If not, the BRAF bit is set low, 
and the processor resumes execution past the end of the code’s loop. For ex- 
ample, 


SPLK #010h,BRCR. ;Set loop count to 16. 
RPTB END_LOOP—1 *For I = BRCR; I >=0; I-—. 


ZAP sACC = PREG = 0. 
SQRA *,AR2 ;PREG = X2. 
SPL SQRX *Save X2. 
MPY * sPREG = b x X. 
LTA SQRX sACC = bX. TREG = X2. 
MPY * ;PREG = aX?2., 
APAC *ACC = aX2 + bX. 
ADD *,0,AR3 eACC = aX2 + bX + c = Y. 
SACL *,0,AR1 sSave Y. 
CRGT *Save MAX. 
END _ LOOP 


The example implements 16 executions of Y = aX* + bX + c and saves the 
maximum value in ACCB. Note that the initialization of the auxiliary registers 
is not shown in the coded example. PAER is loaded with the address of the last 
word in the code segment. The label END_LOOP is placed after the last in- 
struction, and the RPTB instruction long immediate is defined as 
END_LOOP-1 in case the last word in the loop is a two-word instruction. 


There is only one set of block repeat registers, so multiple block repeats cannot 
be nested without saving the context of the outside block or using BANZD. The 
simplest method of executing nested loops is to use the RPTB for only the in- 
nermost loop and using BANZD for all the outer loops. This is still a valuable 
cycle-saving operation because the innermost loop is repeated significantly 
more times than the outer loops. Block repeats can be nested by storing the 
context of the outer loop before initiating the inner loop, then restoring the outer 
loop’s context after completing the inner loop. The context save and restore 
are shown in the following example: 


SMMR - BRCR,TEMP1 ;Save block repeat counter 
SMMR PASR,TEMP2 :;Save block start address 
SMMR #£PAER, TEMP3 Save block end address 


SPLK #NUM_LOOP,BRCR ;Set inner loop count 


RPTB END_INNER *For I = 0; I<=BRCR; I++ 
END_INNER 

OPL #1,PMST ;Set BRAF to continue outer loop 

LMMR BRCR,TEMP1 ;Restore block repeat counter 

LMMR PASR, TEMP2 ;Restore block start address 

LMMR £PAER, TEMP3 ;Restore block end address . 
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In this example, the context save and restore operations take 14 cycles. Note 
that repeated single and BANZ/BANZD loops can also be inside a block re- 
peat. The repeated code can include subroutine calls. Upon returning, the 
block repeat resumes. Repeated blocks can be interrupted. When an enabled 
interrupt occurs during a repeated block of code, the CALU traps to the inter- 
rupt and, when the ISR returns, the block repeat resumes. 


Be extremely careful when interrupting block repeats. If the interrupt service 
routine uses block repeats, check whether a block repeat has been interrupted 
and, if so, save the context of the block repeat as shown in the previous exam- 
ple. Smaller external loops can be implemented with the BANZD-looping 
method that takes two extra cycles per loop (thatis, ifthe loop countis less than 
8, it may be more efficient to use the BANZD technique). Single-cycle instruc- 
tions can be repeated within a block repeat by using the RPT or RPTZ instruc- 
tions. 


While a block is being repeated, the block repeat active flag (BRAF) of the 
PMST register is set to a one. This flag is set by the execution of the RPTB 
instruction and is reset when the PC = PAER and BRCR = 0. This flag can be 
cleared and/or reset via the PMST register. WHILE loops can be implemented 
with the RPTB instruction and aconditional reset of the BRAF bit. The following 
code example clears BRAF so that the processor will drop out of the code loop 
and continue to sequentially access instructions past the end of the loop if an 
overflow occurs: 


XC 2,0V sIf overflow, 
APL #OFFFEh,PMST ;then turn off block repeat. 


The equivalent of a WHILE loop can be implemented by setting the BRAF bit 
to zero if the exit condition is met. If this is done, the program completes the 
current pass through the loop but does not go back to the top. To exit, the bit 
must be reset at least four instruction words before the end of the loop. You 
can exit block repeat loops and return to them without stopping and restarting 
the loop. Branches, calls, and interrupts do not necessarily affect the loop. 
When program control is returned to the loop, loop execution is resumed. The 
following example illustrates the block repeat with a small loop of code that ex- 
ecutes a series of tasks. The tasks are stored in a table addressed by TEMPOF. 
The number of tasks to be executed is defined at NUM_TASKS. 


BLKP NUM_TASKS,BRCR ;Set loop count. 
SPLK #(TASKS—1),TEMPOF ;TEMPOF points to list of tasks. 
RPTB ENDCALL—1 ;For I = 0, I <= NUM_TASKS; I++. 
TASK_HANDLER 
LACC TEMPOF ;ACC points to task table. 
ADD #1 ;Increment pointer to next task. 
SACL TEMPOF ;Save for next pass of loop. 
TBLR TEMPOE ;Get task address. 
LACC TEMPOE ;ACC = task address. 
CALA ;Call task. 
ENDCALL 
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In the setup for the example, the block repeat counter (BRCR) is loaded with 
the number of tasks to be executed, minus 1. Next, the address of the task 
table is loaded into a temporary register. The block repeat is started with the 
execution of the RPTB instruction. The PASR register is loaded with the ad- 
dress of the LACC TEMPOF instruction. The PAER register is loaded with the 
address of the last word of the table. Notice that the label marking the end of 
the loop is placed after the last instruction, then the PAER is loaded with that 
label, minus 1. Itis possible to place the label before the CALA instruction, then 
load the PAER with the label address because this is a one-word instruction. 
However, if the last instruction in this loop had been a two-word instruction, the 
second word of the instruction would not be read, and the long immediate oper- 
and would be substituted with the first instruction in the loop. 


Inside the loop, the pointer to the task table is incremented and saved. Then, 
the task address is read from the table and loaded into the accumulator. Next, 
the task is called by the CALA instruction. Notice that, when the task returns 
to the task handler, it returns to the top of the loop. This is because the PC has 
already been loaded with the PASR before the CALA executes the PC discon- 
tinuity. Therefore, when the CALA is executed, the address of the top of the 
loop is pushed onto the PC stack. 


The last two words of a repeat-block loop are not interruptible. In other words, 
the interrupt path will not be taken while the last two instruction words of a re- 
peat block are being fetched. 


Example 3—1.Interrupt Operation With a Single-Word Instruction at the End of an RPTB 


RPTB END_LOOP—1 
SAR ARO, * <— interrupt path taken here 
if not the last loop iteration 


LACC *+ 
SACL * <— interrupt occurs here 

ENDLOOP: 
MAR *, AR1 < Interrupt path taken here if interrupt 


occurs during last two instruction words 
of the last loop iteration 


Example 3—2.Interrupt Operation With a Two-Word Instruction at the End of an RPTB 


RPTB END LOOP-1 
SAR ARO,* <— interrupt path taken here 
if not the last loop iteration 


LACC *+ 


SPLK #1234h,* <— interrupt occurs here 
ENDLOOP : 
MAR *, ARI <— Interrupt path taken here if interrupt 


occurs during last two instruction words 
of the last loop iteration 
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Note that any incoming interrupt is latched by the 'C5x as soon as it meets the 
interrupt timing requirement. However, the PC does not branch to the corre- 
sponding interrupt service routine vector if it is fetching the last two words of 
a repeat-block loop. This behavior is functionally equivalent to disabling inter- 
rupts before fetching the last two instruction words, and re-enabling interrupts 
afterward. Interrupt operation with repeat blocks potentially increases the 
worst-case interrupt latency time. 


3.6.6 Power-Down Mode 
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In the power-down mode, the ’'C5x core enters a dormant state and dissipates 
considerably less power than normal. Power-down mode is invoked either by 
executing the IDLE/ IDLE2 instructions or by driving the HOLD input low with 
the HM status bit set to one. 


While the ’C5x is in power-down mode, all its internal contents are maintained; 
this allows operation to continue unaltered when power-down mode is termi- 
nated. All CPU activities are halted when the IDLE instruction is executed but 
the CLKOUT1 pin remains active. The peripheral circuits continue to operate, 
allowing the peripherals such as serial ports and timers to take the CPU out 
of its powered-down state. Power-down mode, when initiated by an IDLE in- 
struction, is terminated upon receipt of an interrupt. If INTM = 0, then the pro- 
cessor enters the interrupt service routine when IDLE is terminated. If INTM 
= 1, then the processor continues with the instruction following IDLE. 


The IDLE2 instruction is used for a complete shutdown of the core CPU as well 
as all on-chip peripherals. Because the on-chip peripherals are stopped with 
this power-down mode, they cannot be used to generate the interrupt to wake 
the device as described above on the IDLE mode. However, the power is sig- 
nificantly reduced because the complete device is stopped. This power-down 
mode is terminated by activating any of the external interrupt pins (RS, NMI, 
INTT, INT2, INTS3, and INT4) for at least five machine cycles. Once again, if 
INTM = 0, then the processor enters the interrupt service routine when the 
IDLE2 instruction is terminated. If INTM =1, then the processor continues with 
the instruction following the IDLE2. It is advisable to reset peripherals when 
IDLE2 terminates execution, especially if they are externally clocked. 


Power-down mode can also be initiated by the HOLD signal. When the HOLD 
signal initiates power-down and HM=1, the CPU stops executing; also, ad- 
dress and control lines go into high impedance for further power reduction. If 
HM=0 when HOLD initiates power-down, address and memory control signal 
drivers still go into high impedance, but the CPU continues to execute internal- 
ly. If external memory accesses are not currently required in the system, the 
HM=0 mode can be used. The device continues to operate normally unless an 
off-chip access is required by an instruction, at which time the processor halts 
until the hold is removed. When the HOLD signal initiates the power-down 
mode, power-down mode is terminated when HOLD goes inactive. HOLD 
does not stop operation of on-chip peripherals (i.e., on-chip timers and serial 
ports continue to operate, regardless of the level on HOLD or the condition of 
the HM bit). 
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3.7 Parallel Logic Unit (PLU) 


The parallel logic unit (PLU) can directly set, clear, test, or toggle multiple bits 
in a control/status register or any data memory location. The PLU, shown in 
the block diagram in Figure 3—16, provides a direct logic operation path to data 
memory values without affecting the contents of the accumulator or product 
register. It can be used to set or clear multiple bits in a control register or to test 
multiple bits in a flag register. 


Figure 3-16. Parallel Logic Unit Block Diagram 


The PLU executes a read-modify-write operation on data stored in data space. 
The PLU operation begins with the fetching of one operand from data memory 
space and the fetching of the second from either long immediate on the pro- 
gram bus or the dynamic bit manipulation register (DBMR). Then, the PLU ex- 
ecutes a logical operation defined by the instruction on the two operands. The 
result is written to the same data memory location from which the first operand 
was fetched. 


The PLU allows the direct manipulation of bits in any location in data memory 
space. This direct bit manipulation is done by ANDing, ORing, XORing, or 
loading a 16-bit long immediate value to a data location. For example, to use 
AR1 for circular buffer 1 and AR2 for circular buffer 2 but not enable the circular 
buffers, initialize the circular buffer control register (CBCR) by executing this: 


SPLK #021h,CBCR ;Store peripheral long immediate (DP = 0). 
To later enable circular buffers 1 and 2, execute 
OPL #088h,CBCR ;Set bit 7 and bit 3 in CBCR. 
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Test for individual bits in a specific register or data word via the BIT instruction; 
however, test against a pattern with the CPL (compare parallel long immedi- 
ate) instruction. If the data value is equal to the long immediate value, then the 
TC bit is set to 1. The TC bit is set if the result of any PLU instruction is zero. 


The bit set, clear, and toggle functions can also be executed with a 16-bit dy- 
namic register value instead of the long immediate value. This is done with the 
following three instructions: APL (AND DBMR register to data), OPL (OR 
DBMR register to data), and XPL (XOR DBMR register to data). 


The TC bit in ST1 is also set by the APL, OPL, XPL instructions if the result of 
the PLU operation (value written back into data memory) is zero. This allows 
bits to be tested and cleared simultaneously. For example, 


APL #0OFFO0Oh, TEMP :Clear low byte and check for 
sbits set in high byte. 

BCND HIGH _BITS_SET,NTC ;If bits active in high byte, 
*then branch. 


or 


XPL #1, TEMP *Toggle bit 0. 
BCND BIT SET,TC ;If bit was set, branch. If not, bit set now. 


In the first example, the low byte of a flag word is cleared while the high byte 
is checked for any active flags (bits = 1). If none of the flags in the high byte 
are set, then the resulting APL operation yields a zero to TEMP and the TC bit 
is set to 1. If any of the flags in the high byte are set, then the resulting APL 
operation yields a nonzero value to TEMP and the TC bit is set to 0. Therefore, 
the conditional branch (BCND) following the APL instruction branches if any 
of the bits in the high byte are nonzero. The second example tests the flag. If 
low, it is set high; if high, it is cleared and the branch is taken. The PLU instruc- 
tions can operate anywhere in data address space, so they can be used to op- 
erate with flags stored in RAM locations as well as control registers for both 
on- and off-chip peripherals. 
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3.8.1 


Interrupts 


Reset 


The 'C5x core CPU supports sixteen user-maskable interrupts (INT16—INT1). 
However, each ’C5x DSP does not necessarily use all 16. For example, the 
‘C50, C51, and ’C53 use only nine of these interrupts (the others are tied high 
inside the device). Interrupts can be generated by the serial ports (RINT, XINT, 
TRNT, and TXNT), by the timer (TINT), and by the software interrupt (TRAP 
and INTR) instructions. The reset (RS) interrupt has the highest priority, and 
the INT16 interrupt has the lowest priority. 


Reset (RS) is a nonmaskable external interrupt that can be used at any time 
to put the ’C5x into a known state. Reset is typically applied after power-up 
when the machine is in an unknown state. 


Driving the RS signal low causes the ’C5x to terminate execution and forces 
the program counter to zero. RS affects various registers and status bits. At 
power-up, the state of the processor is undefined. For correct system opera- 
tion after power-up, a reset signal must be asserted low for several clock 
cycles so that data lines are put into the high-impedance state and address 
lines are driven low (see Appendix A for specific timings). The device will latch 
the reset pulse and generate an internal reset pulse long enough to guarantee 
a reset of the device. Several clock cycles after deasserting reset (see Appen- 
dix A), the reset vector at program address zero is fetched. 


When the R&S signal is received, the following actions occur: 


1) Alogic 0 is loaded into the CNF (configuration control) bit in status register 
ST1, mapping dual-access RAM block 0 into data address space. 


2) The program counter (PC) is set to 0. The address bus (lines A15 — AO) 
is unknown while R$ is low. IF HOLD is asserted while RS is low, HOLDA 
is generated. In this case, the address lines are placed into a high-impe- 
dance state until HOLD is brought back high. 


3) All interrupts are disabled by setting the INTM bit (interrupt mode) to 1; 
note that RS and NMI are nonmaskable. The interrupt flag register (IFR) 
is cleared. 


4) Status bits are set as follows: 
O0—-OV, 1—XF, 1—SXM, 0—PM, 1—HM, 0O-— BRAF, 
0-> TRM, 0-> NDX, 0->CENB1, 0->CENB2, 0 -—-I!PTR, 
0—-OVLY, O0—AVIS, O—RAM, O-— BIG, O-—-CNKF, 
1—INTM, MP/MC (Pin) - PMST (MP/MC), and 1—C, 


Note that the remaining status bits remain undefined and should be initial- 
ized appropriately. 
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5) The global memory allocation register (GREG) is cleared to make all 
memory local. 


6) The repeat counter (RPTC) is cleared. 


7) ThelACK (interrupt acknowledge) signal is generated in the same manner 
as a maskable interrupt. 


8) Asynchronized reset (SRESET) signal is sent to the peripheral circuits to 
initialize them. See subsection 5.1.3 for peripheral reset information. 


Execution starts from location 0 of program memory when the RS signal is tak- 
en high. Note that if HOLD is asserted while RS is low, normal reset operation 
occurs internally, but all buses and control lines remain in a high-impedance 
state and HOLDA is asserted, as shown in Figure 3—17(a) and (b). However, 
if RS is asserted while HOLD/HOLDA are low, the CPU comes out of the hold 
mode momentarily by deasserting HOLDA. This condition should be avoided. 
Upon release of HOLD and RS, execution starts from location zero. 
Figure 3-17 (a) and (b) shows two valid ways of exiting reset and hold. 


Figure 3-17. RS and HOLD Interaction 
a) 


3.8.2 Interrupt Operation 


This subsection explains interrupt organization and management. Vector rela- 
tive locations and priorities for all internal and external interrupts are shown in 
Table 3-12. 
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The TRAP instruction (software interrupts) is not prioritized but is included 
here because it has its own vector location. Each interrupt address has been 
spaced apart by two locations so that branch instructions can be accommo- 
dated in those locations. To make vectors stored in ROM reprogrammable, 
use the following code: 


LAMM TEMPO *ACC = ISR address. 
BACC *Branch to ISR. 


TEMPO resides in B2 and holds the address of the interrupt service routine 
(ISR). Note that the ISR addresses must be loaded into B2 before interrupts 
are enabled. Further information regarding interrupt operation, with respect to 
specific devices in the ’C5x generation, is located in Chapter 5, Peripherals. 


The interrupt vectors can be remapped to the beginning of any 2K-word page 
in program memory. The interrupt vector address is generated by concatenat- 
ing the IPTR bits of the PMST with the interrupt vector number (1—16) shifted 
by one as shown in Figure 3—18. 


Table 3—12. Interrupt Locations and Priorities 


Priority 


Location 


1 highesd 


: 


4 


wh, awd, oh, awk oh —s —_, — 


4 


10 
12 
14 
16 
18 


z 


user interrupt #11 
user interrupt #12 


7 
10 
11 
12 
13 
14 
15 
16 
17 

N/A 


20 

22 

a 
Tre [28 | 1 
wr [28 | 10 
mrs [30 
mre [ae 
a 
a 


t The interrupt numbers here do not correspond to any specific ‘C5x device. The definitions of the 
interrupts, specific to particular ’C5x devices, are covered in Chapter 5. 
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Figure 3-18. Interrupt Vector Address Generation 


IPTR = 00001 INT =5 
ae / \ 
Vector [O 0 0 O] {1/0 0 0) (1 0 1/0 
Bt [15 14 13 12 10 9 8 (3 2 1/0! 
Upon reset, the IPTR bits are all set to zero, thus mapping the vectors to page 
zero in program memory space. This means the reset vector always resides 
at zero. The interrupt vectors can be moved to another location by loading a 


nonzero value into the IPTR bits. For example, the interrupt vectors can be 
moved to start at location 0800h by loading the IPTR with 1. 


When an interrupt occurs, a flag is activated in the 16-bit interrupt flag register 
(IFR). This happens regardless of whether the interrupt is enabled or disabled. 
Each interrupt is stored in the IFR until it is recognized by the CPU. Any of the 
following four events clears the interrupt flag: 


1) Device reset (R&S is active low), 

2) Program takes the interrupt trap, 

3) Program writes a one to the appropriate bit in IFR, or 

4) Execution of the INTR instruction with the appropriate interrupt number. 


The IFR is located at address 6 in data memory space and can be read to iden- 
tify active interrupts and written to clear interrupts. 


A logic one in an IFR bit position indicates a pending interrupt. A one can be 
written to a specific bit to clear the corresponding interrupt. All pending inter- 
rupts can be cleared by writing the current contents of the IFR back into the 
IFR. The following example clears these two vectors without affecting any oth- 
er flags that may have been set: 


SPLK #5,IFR ;Clear flags for INT1 and INT3. 


An interrupt flag is automatically cleared when the corresponding interrupt trap 
is taken. When the CPU accepts the interrupt, it jams the instruction bus with 
an INTR instruction. This instruction forces the PC to the appropriate address 
and fetches the soft vector. While fetching the first word of the soft vector, it 
generates an interrupt acknowledge ([ACK) signal that clears the appropriate 
interrupt flag bit. The number of the specific interrupt being taken is indicated 
by address bits A1 — A5 on the falling edge of IACK. If the interrupt vectors re- 
side in on-chip memory, the device should be operating in address visibility 
mode (AVIS = 0) for the interrupt number to be decoded. A hardware reset (RS 
is active low) clears all pending interrupt flags. If an interrupt occurs while the 
device is in HOLD and HM = 0, the address will not be present when the [ACK 
goes active low. 


3-56 Architecture 


Interrupts 


The ’C5x has a memory-mapped interrupt mask register (IMR) for masking ex- 
ternal and internal interrupts. A 1 in bit positions 15 through 0 of the IMR en- 
ables the corresponding interrupt, provided thatINTM =0. The IMR is accessi- 
ble with both read and write operations. Note that neither NMI nor R& is in- 
cluded in the IMR; therefore, the IMR has no affect on the nonmaskable inter- 
rupt or reset. 


The INTM (global enable) bit, which is bit 9 of status register STO, enables or 
disables all interrupts. INTM = 0 enables all the unmasked interrupts, and 
INTM = 1 disables these interrupts. The INTM is set to 1 automatically when 
an interrupt trap is taken. If the interrupt service routine is exited via the RETE 
instruction (return from interrupt with automatic re-enable), then the INTM bit 
is re-enabled (set to zero). It can also be set to 1 with a hardware reset (RS 
is low) or by executing a disable interrupt (SETC INTM) instruction. This bit is 
reset to a zero by executing the enable interruptinstruction (CLRC INTM). The 
INTM does not actually modify the IMR or IFR. 


The interrupt latency of ’C5x depends on the current contents of the pipeline. 
The device always completes all instructions in the pipeline before executing 
the soft vector. The following example, Example 3-3, illustrates the minimum 
latency from the time an interrupt occurs externally to the interrupt acknowl- 
edge (IACK). The minimum interrupt acknowledge time is defined as 8 cycles: 


[J 3 cycles to externally synchronize the interrupt 
[.} 1 cycle for the interrupt to be recognized by the CPU 
_.} 4 cycles to execute the INTR instruction and flush the pipeline 


On the ninth cycle, the interrupt vector is fetched and the JACK is generated. 


Example 3—3.Minimum Interrupt Latency 


Interrupt occurs 
before the fetch of 
this instruction { 


Fetch Mainl Main2 
Decode Mainl 
Read 


Execute 


Interrupt This instruction will be 
written to refetched after return from 
IFR i { interrupt 
Main3 Main4 MainS Main6 Dummy Dummy Dummy Vecl Vec2 Dummy Dummy ISR1 
Main2 Main3 Main4 Main5 INTR Dummy Dummy Dummy Vecl Vec2 Dummy Dummy 
Mainl Main2 Main3 Main4 Main5 INTR Dummy Dummy Dummy Vecl Vec2 Dummy 


Mainl Main2 Main3 Main4 Main5 INTR Dummy Dummy Dummy Vecl Vec2 


t Interrupt t INTR jammed t IACK 
latched external into the pipe- generated 
to the CPU line here 


The maximum latency is a function of what is in the pipeline. Multicycle instruc- 
tions add additional cycles to empty the pipeline. This applies to instructions 
that are extended via wait-state insertion on memory accesses. The wait 
states required for interrupt vector accesses also affect the latency. The repeat 
next instruction N times (RPT and RPTZ) also locks out interrupts (including 
NMI, but not reset), and the repeated instruction completes all executions be- 
fore allowing the interrupt to execute. This is to protect the context of the re- 
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peated instructions because when repeated, the instructions run more parallel 
operations in the pipeline, and the context of these additional parallel opera- 
tions cannot be saved in an ISR. The HOLD function takes precedence over 
interrupts and also can delay the interrupt trap. If an interrupt happens during 
an active-HOLD state, the interrupt is taken at the completion of the HOLD 
state, that is, when HOLDA is deasserted. However, if the processor is in con- 
current hold mode (HM bit of ST1 is 0) and the interrupt vector table is located 
in internal memory, then the CPU takes the interrupt, regardless of HOLD sta- 
tus. 


Interrupts cannot be processed between CLRC INTM and the next instruction 
in a program sequence. For example, if an interrupt occurs during an CLRC 
INTM instruction execution, the device always completes CLRC INTM as well 
as the following instruction before the pending interrupt is processed. This en- 
sures that a return (RET) can be executed in an ISR before the next interrupt 
is processed—thus protecting against PC stack overflow. If the ISR is exited 
via a RETE (return from ISR with enable), the CLRC INTM is unnecessary. Of 
course, after a SETC INTM instruction, the following instruction will not be in- 
terrupted. 


3.8.3 Interrupt Context Save 


When an interrupt trap is executed, certain strategic registers are saved auto- 
matically. When the return from interrupt instruction (RETE or RETI) is ex- 
ecuted, these registers are automatically restored. The program counter (PC) 
is saved on an 8-deep hardware stack. This stack is also used for subroutine 
calls. Therefore, the device supports subroutine calls within the interrupt ser- 
vice routine (ISR) as long as the 8-level stack is not exceeded. Also, there is 
a one-deep stack (or shadow registers) for each of the following registers: 
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ACC accumulator 

ACCB accumulator buffer 

PREG product register 

STO status register 0 

ST1 status register 1 

PMST processor mode status register 
TREGO temporary register for multiplier 
TREG1 temporary register for shift count 
TREG2 temporary register for bit test 
INDX indirect address index register 
ARCR auxiliary register compare register 


When the interrupt trap is taken, all these registers are each pushed onto a 
one-level stack, with the exception of the XF bitin ST1 and the INTM bitin STO. 
On an interrupt, the INTM bit is always set to 1 to disable interrupts. The values 
in the registers at the time of the trap are still available to the ISR but are also 
protected in the stack. The stack is popped when the return from interrupt 
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(RETI or RETE) is executed. This system allows the CPU to be used without 
requiring context save and restore overhead in the ISR. 


With only a one-level stack for the above 11 registers, this hardware does not 
support nested interrupts. In most cases, this is not a problem, because with- 
out the context save and restore overhead, serial processing of the interrupts 
is so efficient that nested interrupt handling is less effective. If the application 
does require nested interrupts, they can be handled by using a software stack. 
Software compatibility with the ‘C25 is maintained because the RET instruc- 
tion, used to return from the ISR on a’C25, does not pop these registers. Inter- 
rupts are not re-enabled unless an RETE or a CLRC INTM instruction is ex- 
ecuted. 


In acase where the ISR needs to modify values in these registers with respect 
to the interrupted code, these registers can be popped from the stack as shown 
in the following example and modified: 


ISR 
LACC #ISR_RE ENTER ;ACC = address of reentry point. 
PUSH ;Top of stack = reentry point. 
RETI ;Pop all the stacks. 


ISR_RE ENTER 


CLRC INTM 
RET ;Return to interrupted code. 


In the example, the address of the reentry point within the ISR is pushed onto 
the PC stack. The RETI instruction pops all the stacks, including the PC stack, 
and resumes execution. At the end of the ISR, a standard return is executed 
because the stack is already popped. 


Not all of the 16 core CPU interrupts are necessarily used on any given 'C5x 
device. The vectors for the interrupts not tied to specific external pins or inter- 
nal peripherals can be used as software interrupts. To use the corresponding 
interrupt vectors as software traps with full context save and restore, execute 
the INTR instruction with the appropriate interrupt number as an operand. 
These traps are protected from other interrupts in the same way the ISR is pro- 
tected; all interrupts are globally masked via the INTM bit. To execute the con- 
text restore, these trap routines must be exited via the RETI or RETE instruc- 
tion. For example, 


INTR 15 ;Software trap to address O1Eh. 


In this example, the processor traps to the vector relatively located at 01Eh. 


3.8.4 Nonmaskable Interrupt 


The core of the ’C5x has two nonmaskable interrupts, RS (reset) and NMI. Re- 
set is discussed in subsection 3.8.1 NMI is a soft reset. It is different from a 
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standard interrupt because it is not maskable, and it does not invoke the auto- 
matic context save. The context save is not invoked, because it is possible to 
take the NMI even during an interrupt service routine. In addition, interrupts are 
globally disabled during an NMI instruction. The NMI is different from reset in 
that it does not affect any of the modes of the device. Note that some 'C5x de- 
vices may not make the NMI available externally. The NMI is also delayed by 
multicycle instructions (including RPT) and by HOLD, as described in subsec- 
tion 3.8.2. The NMI trap can also be initiated via software using the NMI in- 
struction. This instruction forces the PC to the NMI trap location. 
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The ’C5x instruction set supports numerically intensive signal-processing op- 
erations as well as general-purpose applications, such as multiprocessing and 
high-speed control. The instruction set is a superset of the ’'C1x and ’C2x in- 
struction sets and is source-code upward compatible with both devices. This 
chapter describes the assembly language instructions for the ’C5x digital sig- 
nal processor. Included in this chapter are the following major topics: 


Topic Page 
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4.1 Memory Addressing Modes 


4.1.1 
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The ’C5x instruction set provides six basic memory addressing modes: 


Direct addressing mode 

Indirect addressing mode 

Immediate addressing mode 

Dedicated register addressing mode 
Memory-mapped register addressing mode 
Circular addressing mode 


OUOUOU 


Both direct and indirect addressing can be used to access data memory. Direct 
addressing concatenates seven bits of the instruction word with the nine bits 
of the data memory page pointer to form the 16-bit data memory address. Indi- 
rect addressing accesses data memory through one of eight auxiliary regis- 
ters. In immediate addressing, the data is based on a portion of the instruction 
word(s). Two types of immediate addressing modes are available: short and 
long. In short immediate addressing, an 8-/9-/13-bit operand is included in the 
instruction word. Long immediate addressing mode uses as its operand a 
16-bit word following the instruction. Dedicated register addressing refers to 
the block move instructions in which the BMAR register addresses program 
or datamemory and the parallel logic unit (PLU) instructions in which operands 
are obtained from the DBMR register. Memory-mapped register addressing 
mode is used to load and store memory-mapped registers. Circular address- 
ing is an additional mode of indirect addressing that automatically wraps to the 
beginning of a block of data when the end of the block is reached. The following 
subsections describe each addressing mode and give the opcode formats and 
some examples for each mode. 


Direct Addressing Mode 


In the direct memory addressing mode, the instruction contains the lower 
seven bits of the data memory address (dma). This field is concatenated with 
the nine bits of the data memory page pointer (DP) register to form the full 
16-bit data memory address. Thus, the DP register points to one of 512 possi- 
ble 128-word data memory pages, and the 7-bit address in the instruction 
points to the specific location within that data memory page. The DP register 
is loaded by using the LDP (load data memory page pointer) or the LST #0 
(load status register STO) instructions. 
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Note: 


The data page pointer is not initialized by reset and, therefore, is undefined 
after power-up. The ’C5x development tools, however, utilize default val- 
ues for many parameters, including the data page pointer. Because of this, 
programs that do not explicitly initialize the data page pointer may execute 
improperly, depending on whether they are executed on a ’C5x device or 
with a development tool. Thus, it is critical that all programs initialize the 


data page pointer in software. 
a nO ET | 


Figure 4-1 illustrates how the 16-bit data address is formed. 


Figure 4—1. Direct Addressing Block Diagram 


7 LSBs From Instruction Register (IR) 


16-Bit Data Address 


The direct addressing format is as follows: 
15 14 13 12 1 #10 9 8 7 


6 5 4 3 2 1 O 
Se 


Bits 15 through 8 contain the opcode. Bit 7 = 0 defines the addressing mode 
as direct, and bits 6 through O contain the data memory address (dma). 


Example of direct addressing format: 


ADD 9h,5 ;:The contents of data address 9h is 
sleft—shifted 5 bits and added to the 
scontents of the accumulator. 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 


foot oo 7 0 tfo 
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4.1.2 


Modes 


The opcode of the ADD 9h,5 instruction is 25h and appears in bits 15 through 
8. The shift count of 5 appears in bits 11 through 8 of the opcode. The data 
memory address 09h appears in bits 6 through 0. 


Indirect Addressing Mode 


Eight auxiliary registers (ARO—-AR7) provide flexible and powerful indirect ad- 
dressing on the 'C5x. To select a specific auxiliary register, load the auxiliary 
register pointer (ARP) with a value from 0 through 7, designating ARO through 
AR7, respectively (see Figure 4—2). 


Figure 4—2. Indirect Addressing Block Diagram 
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Auxiliary Registers 


, ARO (16) 
| ARP (3) 3 AR1 (16) 
(ARP = 2) >| Are (16) 
| ARS (16) | | 
|  AR4 (16) 
| ARS (16) 
| AR6 (16) 
| AR7 (16) 


16 


=s 

oO) 
par 
oO) 


ARAU (16) 
16-Bit Data Address 


The contents of the auxiliary registers may be operated upon by the auxiliary 
register arithmetic unit (ARAU), which implements unsigned16-bit arithmetic. 
The ARAU performs auxiliary register arithmetic operations in the decode 
phase of the pipeline. This allows the address to be generated before the de- 
code phase of the next instruction. The AR is incremented or decremented 
after it is used in the current instruction. 


In indirect addressing, any location in the 64K data memory space can be ac- 
cessed via a 16-bit address contained in an auxiliary register. The LAR instruc- 
tion loads the address into the register. The auxiliary registers on the 'C5x may 
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be modified by ADRK (add to auxiliary register short immediate) or SBRK (sub- 
tract from auxiliary register short immediate); they may also be modified by the 
MAR (modify auxiliary register) instruction or, equivalently, by the indirect ad- 
dressing field of any instruction supporting indirect addressing. AR(ARP) de- 
notes that the auxiliary register is to be selected by ARP. The auxiliary registers 
can also be loaded via the data bus by using memory-mapped writes to the 
auxiliary registers. The following instructions can write to the memory-mapped 
auxiliary registers: APL, BLDD, LMMR, OPL, SACH, SACL, SAMM, SMMR, 
SPLK, and XPL. Be careful when using these memory-mapped loads of the 
auxiliary registers because in this case the memory-mapped auxiliary regis- 
ters are modified in the execute phase of the pipeline. This causes a pipeline 
conflict if one of the next two instruction words modifies that auxiliary register. 
For further information on the pipeline and possible pipeline conflicts, see sub- 
section 3.6.2. 


The following symbols are used in indirect addressing, including bit-reversed 
(BR) addressing: 
. Contents of AR(ARP) are used as the data memory address. 


* Contents of AR(ARP) are used as the data memory address and de- 
cremented after the access. 


*+ Contents of AR(ARP) are used as the data memory address and in- 
cremented after the access. 


*0- Contents of AR(ARP) are used as the data memory address, and the 
contents of INDX are subtracted from it after the access. 


*O+ Contents of AR(ARP) are used as the data memory address, and the 
contents of INDX are added to it after the access. 


*BRO— Contents of AR(ARP) are used as the data memory address, and the 
contents of INDX are subtracted, with reverse carry (rc) propagation, 
from it after the access. 


*BRO+ Contents of AR(ARP) are used as the data memory address, and the 
contents of INDX added, with reverse carry (rc) propagation, to it after 
the access. 


There are two primary types of indirect addressing with indexing: 
(.} Regular indirect addressing with increment or decrement, and 


[.) Indirect addressing with indexing based on the value of INDX: 
MM Indexing by adding or subtracting the contents of INDX, or 
Mi Indexing by adding or subtracting the contents of INDX with the carry 
propagation reversed (for FFTs on the 'C5x). 


In either case, the contents of the auxiliary register pointed to by the ARP regis- 
ter are used as the address of the data memory operand. Then, the ARAU per- 
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forms the specified mathematical operation on the indicated auxiliary register. 
Additionally, the ARP may be loaded with a new value. All indexing operations 
are performed on the current auxiliary register in the same cycle as the original 
instruction decode phase of the pipeline. 


Indirect auxiliary register addressing allows for post-access adjustments of the 


- auxiliary register pointed to by the ARP. The adjustment may be an increment 


or decrement by one or may be based upon the contents of the INDX register. 
To maintain compatibility with the ’C2x devices, set the NDX bit in the PMST 
register to 0. In the ’C2x architecture, the current auxiliary register can be in- 
cremented or decremented by the value in the ARO register. When the NDX 
bit is set to 0, every ARO modification or LAR write also writes the ARCR and 
INDX registers with the same value. Subsequent modifications of the current 
auxiliary registers using indexed addressing will use the INDX register, there- 
fore maintaining compatibility with existing 'C2x code. The NDX bit is set to 0 
at reset. 


Bit-reversed addressing modes on the ’C5x allow efficient I/O to be performed 
by the resequencing of data points in a radix-2 FFT program. The direction of 
Carry propagation in the ARAU is reversed when this mode is selected, and 
INDX is added to/subtracted from the current auxiliary register. Typical use of 
this addressing mode requires that INDX first be set to a value corresponding 
to one-half of the array’s size, and that AR(ARP) be set to the base address 
of the data (the first data point). 


Indirect addressing can be used with all instructions except those with immedi- 
ate operands or with no operands. The indirect addressing format is as follows: 


15 14 13 12 11109 8 7 6 5 4 3 2 #1 =#0 


Bits 15 through 8 contain the opcode, and bit 7 = 1 defines the addressing 
mode as indirect. Bits 6 through 0 contain the indirect addressing control bits. 


Bit 6 contains the increment/decrement value (IDV). The IDV bit determines 
whether the INDX register will be used to increment or decrement the current 
auxiliary register. If bit6 = 0, an increment or decrement (if any) by one occurs 
to the current auxiliary register. If bit 6 = 1, the INDX register is added to or sub- 
tracted from the current auxiliary register as defined by bits 5 and 4. 


Bits 5 and 4 control the arithmetic operation to be performed with AR(ARP) and 
the INDX register. When set, bit 5 indicates that an increment is to be per- 
formed. If bit 4 is set, a decrement is to be performed. Table 4—1 shows the 
correspondence of bit pattern and arithmetic operation. 
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Table 4—1. Indirect Addressing Arithmetic Operations 


Arithmetic Operation 
6 


No operation on AR(ARP) 

AR(ARP) - 1 — AR(ARP) 

AR(ARP) + 1 —> AR(ARP) 

Reserved 

AR(ARP) — INDX —» AR(ARP) [reverse carry propagation] 
AR(ARP) — INDX — AR(ARP) 

AR(ARP) + INDX -» AR(ARP) 

AR(ARP) + INDX — AR(ARP) [reverse carry propagation] 


=-~o-0O-0O+ Of] & 


0 
0 
0 
0 
1 
1 
1 
1 


—~—-i§ OO $$ =| OO 


Bit 3 and bits 2 through 0 control the auxiliary register pointer (ARP). Bit 3 
(NAR) determines whether new value is loaded into the ARP. If bit 3 = 1, the 
contents of bits 2 through O (Y = next ARP) are loaded into the ARP. If bit 3 = 
0, the contents of the ARP remain unchanged. If the ARP is loaded with a new 
value, the old value is loaded into the auxiliary register buffer (ARB) in the ST1 
status register. 


Table 4—2 shows the bit fields, notation, and operation used for indirect ad- 
dressing. 


Table 4—2. Bit Fields for Indirect Addressing 


Instruction Field Bits Operation 
15 - 876543210 


< Opcode -~10000+<«+Y- No manipulation of ARx/ARP 


<— Opcode -~10001<«yY-— PY | Y- ARP 


- Opcode 7~10010<Y- AR(ARP) — 1 > AR(ARP) 


< Opcode >-~1001%1+¢«+Y- * AR(ARP) — 1 — AR(ARP) 
Y — ARP 
) 


< Opcode -10100<«Y- Cee AR(ARP) + 1 — AR(ARP) 


« Opcode 7~10101+«Y-— + Y AR(ARP) + 1 AR(ARP) 
Y — ARP 


«- Opcode -11000+Y- AR(ARP) - rcINDX — AR(ARP) t 


- Opcode ~11001<+Y-> *BRO—Y AR(ARP) — rclNDX — AR(ARP) 
Y -» ARP t 


*V 
—Y 
*+ 
< Opcode -~11010+«+Y-= AR(ARP) — INDX —> AR(ARP) 
*O+ 


+ Opcode ~711011<Y- *O—Y AR(ARP) — INDX — AR(ARP) 
Y + ARP 


+ Opcode -~11100+«Y- 0+ | AR(ARP) + INDX —> AR(ARP) 


« Opcode 7~11101<Y- *04,Y AR(ARP) + INDX — AR(ARP) 
Y —» ARP 


<— Opcode >->11110<«+Y- AR(ARP) + rcINDX — AR(ARP) t 


< Opcode >~1111%1<<«Y-o> *BRO+,Y AR(ARP) + rclINDX -> AR(ARP) 
Y — ARPT 


Tt BR = bit-reversed addressing mode and rc = reverse carry propagation 
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Example 1 


Example 2 


Example 3 


Example 4 


Example 5 


Example 6 


Example 7 
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The CMPR (compare auxiliary register with ARCR) and TC/NTC conditions fa- 
cilitate conditional branches, calls, returns, or conditional executes according 
to comparisons between the contents of ARCR and the contents of AR(ARP). 
To maintain compatibility with the TMS320C2x devices, set the NDX bit in the 
PMST register to 0. In the ’'C2x architecture, the auxiliary register compare 
function is performed by comparing ARO with the current auxiliary register. 
When the NDxX bit is set to 0, every load to ARO loads the ARCR register with 
the same value. Subsequent compares of the current auxiliary register will use 
the ARCR register, therefore maintaining compatibility with existing 'C2x code. 
The NDX bit is set to 0 at reset. The auxiliary registers may also be used for 
temporary storage via the load and store auxiliary register instructions, LAR 
and SAR, respectively, or via any instruction that can load and store the 
memory-mapped auxiliary registers. 


The following examples illustrate the indirect addressing format: 
ADD *+,8 


Add to the accumulator the contents of the data memory address defined by 
the contents of the current auxiliary register. This data is left-shifted 8 bits be- 
fore being added. The current auxiliary register is autoincremented by one. 
The instruction word is O28A0h. 


ADD *,8 
As in Example 1, but with no autoincrement; the instruction word is O2880h. 
ADD *-,8 


As in Example 1, except that the current auxiliary register is decremented by 
one; the instruction word is 02890h. 


ADD *0+,8 


As in Example 1, except that the contents of register INDX are added to the 
current auxiliary register; the instruction word is O28EOh. 


ADD *0-,8 
Asin Example 1, except that the contents of register INDX are subtracted from 
the current auxiliary register; the instruction word is O28D0Oh. 

ADD *+,8,AR3 


As in Example 1, except that the auxiliary register pointer (ARP) is loaded with 
the value 3 for subsequent instructions; the instruction word is O28ABh. 


ADD *BRO-,8 
The contents of register INDX are subtracted from the current auxiliary regis- 
ter, with reverse carry propagation; the instruction word is O28COh. 


Assembly Language Instructions 


Memory Addressing Modes 


Example8 ADD *BRO+,8 


The contents of register INDX are added to the current auxiliary register, with 
reverse Carry propagation; the instruction word is O28FOh. 


4.1.3. Immediate Addressing Mode 


In immediate addressing, the instruction word(s) contains the value of the im- 
mediate operand. The ’C5x has both single-word (8-bit, 9-bit, and 13-bit con- 
stant) short immediate instructions and two-word (16-bit constant) long imme- 
diate instructions. In short immediate instructions, the immediate operand is 
contained within the instruction word itself. In long immediate instructions, the 
word following the instruction word is used as the immediate operand. 


The ’C5x instructions listed in Table 4-3 support immediate addressing. 


Table 4—3. Instructions That Support Immediate Addressing 


8-Bit Immediate 9-Bit Immediate 13-Bit Immediate 16-Bit Immediate 


Example code for the RPT instruction with short immediate addressing: 


RPT #99 sExecute the instruction after RPT 100 times. 


In this example, the immediate operand is contained as a part of the RPT in- 
struction opcode. The instruction word format for RPT with short immediate 
- addressing is as follows: 


15 14 13 12 11109 8 7 6 5 4 3 2 1 0 


For long immediate instructions, the constant is a 16-bit value in the word fol- 
lowing the opcode. The 16-bit value can be optionally used as an absolute con- 
stant or as a 2s-complement value. 


Con an A AN a CO Rr fd RR Le Le COLL Led Lok LOL OL RCL ORS SL ERE ESE CE RET ORE CEE TE OOOCOT CORTON 


The following is an example code and the instruction word format for the RPT 
instruction with long immediate addressing: 


RPT #0FFFh ;Execute instruction after RPT 1000h times. 
15 14 13 12 11109 8 7 6 5 4 3 2 1 0 
1 0 1 1 1 1 1 0 1 1 0 O 0 1 0 O 


16-bit constant 


4.1.4 Dedicated Register Addressing 


Nine instructions in the 'C5x instruction set can use one of two special-purpose 
memory-mapped registers in the core CPU. These two registers are the block 
move address register (BMAR) and the dynamic bit manipulation register 
(DBMR). The APL, OPL, CPL, and XPL parallel logic unit (PLU) instructions 
use the contents of the DBMR register when an immediate value is not speci- 
fied as one of the operands. The BLDD, BLDP, and BLPD instructions can use 
the BMAR register to point at the source or destination space of a block move. 
The MADD and MADS also use the BMAR register to address an operand in 
program memory for a multiply-accumulate operation. 


The syntax for dedicated register addressing can be stated in one of two ways: 


1) Specifying BMAR by its predefined symbol as shown below: 


BLDD BMAR,DAT100 ;DP = 0. BMAR contains the value 200h. 
The contents of data memory location 200h are copied to data memory 


location 100 on the current data page. The opcode for this instruction is 
OAC64h. 


2) Excluding the immediate value from parallel logic unit instructions as 
shown below. The BMAR register is implied by the MADD and MADS in- 
struction mnemonics. 

OPL DAT10 ;DP = 6. DBMR contains the value OFFFOh. 
sAddress O30Ah contains the value Olh 

The contents of data memory location O30Ah are ORed with the contents 

of DBMR. The resulting value OFFF 1h is stored back in memory location 

O30Ah. The opcode for this instruction is 590Ah. 


4.1.5 Memory-Mapped Register Addressing 


Memory-mapped register addressing is used for modifying the 
memory-mapped registers without affecting the current data page pointer val- 
ue. In addition, any scratch pad RAM location or data page 0 can be modified 
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by using this addressing mode. Figure 4—3 illustrates how this is done by forc- 
ing the 9 MSBs of the data memory address to zero, regardless of the current 
value of the DP when direct addressing is used or of the current auxiliary regis- 
ter value when indirect addressing is used. The use of these instructions does 
not affect the contents of the DP. 


Figure 4—3. Memory-Mapped Register Addressing Block Diagram 


All bits Os 


7 LSBs from 
Instruction Register (IR) 
or Current Auxiliary Register 


16-Bits Memory-Mapped Register Address 


This addressing mode allows greater flexibility for dealing with 
memory-mapped registers. The overhead required to perform operations in- 
volving a memory-mapped register is greatly reduced because the data page 
pointer (DP) does not need to be modified before and after the operation. The 
following instructions operate in the memory-mapped register addressing 
mode: 


(_.} LAMM — Load accumulator with memory-mapped register 
(_.) SAMM -— Store accumulator in memory-mapped register 
[_.} LMMR — Load memory-mapped register 

J] SMMR -— Store memory-mapped register 


The following examples illustrate the use of these instructions in the direct and 
indirect addressing modes. 


LMMR CBCR,#0800h :DP = 6. Load CBCR memory—mapped register. 


The CBCR memory-mapped register is loaded with the value at location 
0800h. The instruction word for this instruction is 0891Eh, followed by the 
16-bit word O800h. 


SAMM *+ ;Store accumulator to PMST register. 


If the auxiliary register pointer ARP = 3 and auxiliary register AR3 = FFO7h, the 
contents of the accumulator is stored to the PMST register (address 07h) 
pointed at by the last 7 bits of AR3. The instruction word for this instruction is 
08890h. 
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Many algorithms such as convolution, correlation, and FIR filters can make 
use of circular buffers in memory. In these algorithms, a circular buffer is used 
to implement a sliding window, which contains the most recent data to be pro- 
cessed. The 'C5x supports two concurrent circular buffers operating via the 
auxiliary registers. The following five memory-mapped registers control the 
circular buffer operation: 


[) CBSR1 -— Circular Buffer One Start Register 
[} CBSR2 — Circular Buffer Two Start Register 
[j) CBER1 — Circular Buffer One End Register 

[} CBER2 -— Circular Buffer Two End Register 

[) CBCR-- -—Circular Buffer Control Register 


The 8-bit circular buffer control register enables and disables the circular buffer 
operation. The CBCR is defined as follows: 


Identifies which auxiliary register is mapped to circular buffer 1. 


Circular buffer 1, enable=1/disable=0. Set to O upon reset. 
Identifies which auxiliary register is mapped to circular buffer 2. 
Circular buffer 2, enable=1/disable=0. Set to 0 upon reset. 


In order to define circular buffers, the start and end addresses should first be 


loaded into the corresponding buffer registers; next, a value between the start 
and end registers for the circular buffer is loaded into an auxiliary register. The 
proper auxiliary register value is loaded, and the corresponding circular buffer 
enable bit is set in the control register. Note that the same auxiliary register can 
not be enabled for both circular buffers, or unexpected results occur. The algo- 
rithm for circular buffer addressing is as follows (note that the test of the auxilia- 
ry register value is performed before any modifications): 


lf (ARn = CBER) and (any AR modification), 
Then: ARn = CBSR. 
Else: ARn = ARn + step. 


In addition, note that if ARn=CBER and no AR modification occurs, the current 
AR is not modified and is still equal to CBER.Note that when the current auxilia- 
ry register = CBER, any AR modification (increment or decrement) will set the 
current AR = CBSR. The following examples illustrate the operation: 


splk #200h,CBSR1 ; Circular buffer start register 
splk #203h,CBER1 ; Circular buffer end register 
splk #0eh,CBCR ; Enable AR6 pointing to buffer 1 


lar ar6,#200h : Case l 
lacc * * AR6 = 200h 


Assembly Language Instructions 


lar ar6,#203h * Case 2 
* 


lacc * AR6 = 203h 
lar ar6,#200h * Case 3 
lacc *+ > AR6 = 201h 
lar ar6,#203h * Case 4 
lacc *+ * AR6 = 200h 
lar ar6,#200h > Case 5 
lacc *— > AR6 = 1lffh 
lar ar6,#203h > Case 6 
lacc *— > AR6 = 200h 
lar ar6,#202h > Case 7 
adrk 2 > AR6 = 204h 
lar ar6,#203h > Case 8 
adrk 2 * AR6 = 200h 


In circular addressing, the step is the quantity that is being added to or sub- 
tracted from the specified auxiliary register. Take care when using a step of 
greater than one to modify the auxiliary register pointing to an element of the 
circular buffer. lf an update to an auxiliary register generates an address out- 
side the range of the circular buffer, the ARAU does not detect this situation, 
and the buffer does not wrap around. Auxiliary register updates are performed 
as described in subsection 4.1.2. Note that there is a two-cycle latency be- 
tween configuring the circular buffer control registers and performing AR modi- 
fications due to the pipeline. 


Circular buffers can be used in increment- or decrement-type updates. For in- 
crementing the value in the auxiliary register, the value in CBER must be great- 
er than the value in CBSR. For decrementing the value in the auxiliary register, 
the CBSR register value must be greater than the value in the CBER register. 
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4.2 Instruction Set 


The 'C5x assembly language instruction set supports both DSP-specific and 
general-purpose applications. This section lists and groups the ’C5x instruc- 
tion set according to the following functional headings: 


Accumulator memory reference instructions 
Auxiliary registers and data page pointer instructions 
Parallel logic unit instructions 

T register, P register, and multiply instructions 
Branch instructions 

I/O and data memory operations 

Control instructions 


DOOUOOU 


Section 4.1 covers the addressing modes associated with the instruction set, 
and Section 4.3 describes individual instructions in more detail. 


4.2.1 Symbols and Abbreviations 


Table 4—4 lists symbols and abbreviations used in the instruction set summary 
(Table 4—4) and the individual instruction descriptions (Section 4.3). 
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Address 

Accumulator 

16-bit data memory address 

Auxiliary register pointer buffer 

Auxiliary register n (0 <n = 7) 

Auxiliary register pointer 

4-bit field specifying bit code 

Branch control input 

Block move address register 

Carry bit 

2-bit field specifying compare mode 
On-chip RAM configuration control bit 
Data memory address field 

Label assigned to data memory location n 
Dynamic bit manipulation register 

7-bit data memory address 

Data page pointer 

Format status bit 

Frame synchronization mode bit 

Hold mode bit 

Addressing mode bit 

Indirect addressing operands 

Interrupt mode flag bit 

Immediate operand field 

Long immediate operand field 

Microcall stack 

Indicates that nn represents a hexadecimal number 
Overflow bit 

Overflow mode bit 

Product register 

Port address n (0 sn < 65535 ) 

Program counter 

Prefetch counter 

Label assigned to program memory location n 
2-bit field specifying P register output shift code 
Program memory address 

3-bit field specifying auxiliary register 
Repeat counter 

4-bit left-shift code 

Status register n(n =0or 1) 
Sign-extension mode bit 

Temporary register n (n = 0, 1, or 2) 

Test control bit 

Top of stack 

Control bit to enable multiple TREGs 
Transmit mode bit 

XF pin status bit 

Is assigned to 

Absolute value of x 

User-defined items 

Optional items 

Contents of 

Alternative items; one of which must be entered 
Prefix of constants used in immediate addressing 
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Instruction Set Summary 


Table 4—5 is asummary of the instruction set for the 'C5x digital signal proces- 
sors. This instruction set is a superset of the 'C1x and ’C2x instruction sets. 


The instruction set summary is arranged according to function and is alphabet- 
ized within each functional grouping. The number of words that an instruction 
occupies in program memory is specified in column four of the table. Several 
instructions specify two values, separated by a slash mark ”/” for the number 
of words. Different forms of the instruction occupy a different number of words. 
For example, the ADD instruction occupies one word when the operand is a 
short immediate value or two words if the operand is a long immediate value. 
The number of cycles that an instruction requires to execute is in column four 
of the table. All instructions are assumed to be executed from internal program 
memory (RAM) and internal data dual-access memory. The cycle timings are 
for single-instruction execution, not for repeat mode. Additional information is 
presented in the Individual Instruction Descriptions in Section 4.3. Bold type- 
face indicates instructions that are new for the ’C5x instruction set. 


Section 4.4 includes a table that maps ’C2x instructions to ’C5x instructions. 
Note that the Texas Instruments 'C5x assembler accepts ’C2x instructions as 
well as ’C5x instructions. 
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Instruction Set 


Table 4—5. Instruction Set Summary 


Accumulator Memory Reference Instructions 


Description Cycles 
BS Absolute value of ACC Lo 4 


ADCB Add ACCB to ACC with carry ee 
ADD Add to ACC / 


Add ACCB to ACC pd 
L™ | 


2 (long immediate value specified) 
ADDB 
ADDC 
ADDS 
ADDT 


Add to ACC with carry pt | 


Add to low ACC with sign-extension suppressed 


1 

Add to ACC with shift specified by TREG1 
AND AND with ACC | 1 

2 (long immediate value specified) 
ANDB 
BSAR 
CMPL 
CRGT 


CRLT 


AND ACCB with ACC 
Barrel-shift ACC right 


Complement ACC 

Test for ACC > ACCB 

Test for ACC < ACCB 

Swap ACCB with ACC 
Load ACC with ACCB 
LACC Load ACC with shift 


Le | 

ee 

be 

a 

Eo! 

Lo! 

Le 

an (long immediate value specified 

LACL Load low word of ACC rane 
LACT | Load ACC with shift specified by TREG1 Pa of 
=e 

ba 

ee 

ee 

ea 

bod 

1 


1 (processor memory-mapped register) 
2 (peripheral memory-mapped registers) 


1 


Load ACC with contents of memory-mapped 
register 


z 


EG Negate accumulator 
Normalize contents of ACC 


OR with accumulator 


2 (long immediate value specified) 
RB OR ACCB with ACC 

ROL Rotate ACC left 

ROLB 


1/2 
1/2 
1 
1/2 
1 
1 
1 
1 
1 
1/2 
1 


1 
1 
Rotate ACCB and ACC left 
Rotate ACC right 

Rotate ACCB and ACC right 


1 


RORB 
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Table 4—5. Instruction Set Summary (Continued) 


[ao Ratan ose anede 
Mnemonic Words | Cycles | 


1 


Store ACC to memory-mapped register 1 1 (processor memory-mapped register) 
2 (peripheral memory-mapped registers) 


SATH Barrel-shift ACC right 0 or 16 bits as specified 1 
by TREG1 


SATL Barrel-shift ACC right 0 to 15 bits as specified 
by TREG1 


[SRRAGCE endACC HR 


1 
1 


SFLB 


| 2 (long immediate value specified) 


Subtract from ACC 


SUBB 
SUBC 
SUBS 


Subtract from ACC with borrow 


Conditional subtract 


Subtract from low ACC with sign-extension sup- 
pressed 


Subtract from ACC with shift specified by TREG1 


SUBT 
X 


4 1 
a = 
1/ 1 
1 

1/ 1 


Exclusive-OR with ACC 


2 
2 
2 (long immediate value specified) 


XORB Exclusive-OR ACCB with ACC 
ZA Zero low ACC and load high ACC with rounding _ 


Zero ACC and PREG 


Auxillary Registers and Data Page Pointer Instructions 
Words 


BB 
SFL 
SFR 
SUB 
OR 

LR 
ZAP 


Description 
ADRK — Add to ARn short immediate 


SS OL 
a a 
— 


M 


a En a 


Cycles 
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Table 4—5. Instruction Set Summary (Continued) 


Parallel Logic Unit Instructions 


APL AND DBMR or constant with data memory 1/2 1 (second operand DBMR) 
value 2 (second operand long immediate) 
CPL Compare DBMR or constant with data 1/2 1 (second operand DBMR) 
memory value 2 (second operand long immediate) 
OR DBMR or constant with data memory 1 (second operand DBM"? 
value 


2 (second operand long ‘:*:mediate) 
SPLK Store long immediate to data memory ne 
location 


XPL XOR DBMR or constant with data memory 1/2. | 1(second operand DBMR) _ 
value 2 (second operand long immediate) 


T Register, P Register, and Multiply Instructions 
Description Cycles 
APAC Add PREG to ACC 1 
LPH Load high PREG 
Load TREGO 
LTA Load TREGO & accumulate previous product 


LTD Load TREGO, accumulate previous product, and] 1 
move data 


Load TREGO & store PREG in accumulator 
Load TREGO and subtract previous product 


i 


il 


p 
S 
MAC Multiply and accumulate 
MACD Multiply and accumulate with data move 


Multiply and accumulate with source pointed at 
by BMAR 


Multiply and accumulate both with source 
pointed at by BMAR and with data move 


MPY Multiply 1/2 


2 (long immediate value specified) 
MPYA 
MPYS 
MPYU Multiply unsigned 

PAC Load ACC with PREG 

SPAC Subtract PREG from ACC © 

Store high PREG 

Store low PREG 

Set PREG output shift mode 

Square and accumulate previous product 


Multiply and accumulate previous product 
Multiply and subtract previous product 


” 


PL 


SQRA — 
Square and subtract previous product 


Zero product register 
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Table 4—5. Instruction Set Summary (Continued) 


Branch Instructions 
Mnemonic | Description Cycles 
BID] Branch unconditionally 4 (2 if delayed) 


BACC[D] _| Branch to address specified by ACC Loe | 4 (2 if delayed) 


BANZ[D] ~ Branch on ARn not-zero | (conditions true, 2 if delayed) | 


2 (conditions false) 


4 
BCND[D] | Branch conditionally Det least on true, 2 if delayed) 


at least one condition false) 


CALAID] Call subroutine indirect 1 


CALL[D] 


4 (2 if delayed) 
4 (2ifdelayed) 


4 (conditions true, 2 if delayed) 
2 (at least one condition false) 


Call subroutine 


| Callconditionally — 


INTR Soft interrupt 


RET[D] | Return from subroutine 
RETC[D] 


4 (2 if delayed) 


4 (conditions true, 2ifdelayed) 
2 (at least one condition false) 


Return conditionally 


RETE Return with context switch & global interrupt 


| enable 


RETI 
TRAP | | Software interrupt _ 


| Execute next instruction (s) conditionally 


| Nonmaskable interrupt ~ | 


Return with context switch | 


~ /O and Data Memory Operations 


Mnemonic Description _ Cycles 


BLDD Block move from data memory to data memory 1/2 2 peoand specified by BMAR) 
3 (operand specified by long 
| immediate) 
BLDP Block move from data memory to program | 
memory 
BLPD Block move from program memory to data 1/2 2 Ceerend specified by BMAR) — 
memory 3 (operand specified by long 
immediate) 
DMOV Data move in data memory | 
Input data from port 


LMMR Load memory-mapped register 


OUT Output data to port 


Store memory-mapped register 


TBLR Table read 
TBLW Table write | 


Panel memory-mapped et apa 
peripheral memory-mapped register 


G) PO 


oneal memory-mapped reuisten 
peripheral memory-mapped register 


G PO 
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Table 4—5. Instruction Set Summary (Continued) 


Control Instructions 


[Weemonie [Description Cd Woda Cyeles 
A 
[TOLER [Tae ntrinterupt—Tow powormode [1 _ 
[Reeperaion 
[Poptepafaackrodatamemoy | 

ez 

ae 

ame 


Repeat block 


| Repeat next Instruction and clear ACC and 
PREG 


| Set control bit 


ST Store status register 


1 


Note that all writes to external memory require two cycles. Reads require one 
cycle. Any write access immediately before or after a read cycle will require 
three cycles (refer to Appendix B). In addition, if two pipelined instructions try 
to access the same 2K-word long single-access memory block simultaneous- 
ly, one extra cycle is required. For example, the DMOV instruction, when re- 
peated with RPT, requires one cycle in the dual-access RAM but takes two 
cycles in the single-access RAM. Wait states are added to all external ac- 
cesses according to the configuration of the software wait-state registers de- 
scribed in Section 5.3. 
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4.3 Individual Instruction Descriptions 


This section furnishes detailed information on the instruction set for the ’C5x 
family; see Table 4—4, Instruction Set Summary, for a complete list of available 
instructions. Each instruction presents the following information: 


Assembler syntax 
Operands 
Opcode 
Execution 
Description 
Words 

Cycles 

Examples 


OUUOUUOU 


The EXAMPLE instruction is provided to familiarize you with the instruction 
format and explain the contents of the instruction manual pages. 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Direct: [label] EXAMPLE dma [,shiffj 

Indirect: [label] EXAMPLE {ina} [,shiftl,next ARP] 
Short Immediate: [label EXAMPLE [#k] 

Long Immediate: [/abef] EXAMPLE [#I/k] 


Each instruction begins with an assembler syntax expression. Labels may be 
placed either before the command (instruction mnemonic) on the same line or 
on the preceding line in the first column. An optional comment field may con- 
clude the syntax expression. Spaces are required between each field (label, 
command, operand, and comment fields). 


O<sdmas 127 

0 < pma s 65535 
0 <s next ARP <7 
Osk<s255 

Os Ik s 65535 
O<sshifts 15 


ind: {*|*+ |*-|*0+|*O-|*BRO+ | *BRO-} 


The above set of operands is not comprehensive; however, they are the most 
frequently used in the instruction set. Operands may be constants or assem- 
bly-time expressions referring to memory, I/O ports, register addresses, point- 
ers, shift counts, and a variety of other constants. 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 
Xx 'h %& %& K& KX KX KX KX RK KX KR KR KX XK X 


The opcode breaks down the various bit fields that make up each instruction 
word. 


(PC) +1 — PC 
(ACC) + (dma) —> ACC;0 ~ C 


Affected by OVM; affects OV and C. Not affected by SXM. 


The instruction operation sequence describes the processing that takes place 
when the instruction is executed. Conditional effects of status register speci- 
fied modes are also given. Those bits in the ’C5x status registers affected by 
the instruction are also listed. 


Instruction execution and its effect on the rest of the processor or memory con- 
tents are described. Any constraints on the operands imposed by the proces- 
sor or the assembler are discussed. The description parallels and supple- 
ments the information given by the execution block. 


This field specifies the number of memory words required to store the instruc- 
tion and its extension words. 
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nn en rn A ea an a nanan at ala anne ganar nara aan aca aa acsuans alaan ana dara an ian adnan ara anal gh ara an nanan Oak aden ae a i tae aa ra wan aera ee LL nL OL LL LLL LL ne Pn ee a DE ae 


Cycle Timings for a Repeat (RPT) Instruction 
a 


Operand Ext 


The table shows the number of cycles required for a given ’C5x instruction to 
execute in a given memory configuration when executed as a single instruc- 
tiion or in the repeat (RPT) mode. The column headings in the table indicate 
the program source location (PR, PDA, PSA, PE), defined as follows: 


PR The instruction executes from internal program ROM. 

PDA __ The instruction executes from internal dual-access program RAM. 
PSA The instruction executes from internal single-access program RAM. 
PE The instruction executes from external program memory. 


lf an instruction requires memory operand(s), row divisions in the table indi- 
cate the location(s) of the operand(s), as defined below: 


DARAM The operand is in internal dual-access RAM. 
SARAM The operand is in internal single-access RAM. 


Ext The operand is in external memory. 
ROM The operand is in internal program ROM. 
MMR The operand is a memory-mapped register. 


MMPORT _ The operand is a memory-mapped io port. 


The number of cycles required for each instruction is given in terms of the pro- 
cessor machine cycles (CLKOUT1 period). For the RPT mode execution, nin- 
dicates the number of times a given instruction is repeated by an RPT or RPTZ 
instruction. The additional wait states for program/data memory and I/O ac- 
cesses are defined below. Note that these additional cycles can be generated 
by the on-chip software wait state generator or by the external READY signal. 
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Example Instruction _ EXAMPLE 


SBESE SESSELESBSISSSS SSE GS SESE ESE SSDS ISS DES ESS IEDR SSL SIS OSS SOOO SLE SSSI S SSPE SSE 


p Program memory wait states. Represents the number of additional 
clock cycles the device waits for external program memory to respond 
to an access. 


d Data memory wait states. Represents the number of additional clock 
cycles the device waits for external data memory to respond to an ac- 
cess. 

io I/O wait states. Represents the number of additional clock cycles the 
device waits for an external I/O to respond to an access. 

n Repetitions (where n> 2 to fill the pipeline). Represents the number of 


times a repeated instruction is executed. 


The above variables can also use the subscripts src, dst, and code to indicate 
source, destination, and code, respectively. 


Note that the internal single-access memory on each 'C5x processor is divided 
into 1K- or 2K-word blocks contiguous in address space: 


Four 2K-word block 0800h—0FFFh 
1000h—17FFh 
1800h—1FFFh 
2000h—27FFh 


One 1K-word block 2800h—2BFFh 


All ’C5x processors support parallel accesses to these internal single-access 
RAM blocks. However, one single access block allows only one access per 
cycle. In other words, the processor can read/write on single-access RAM 
block while accessing another single-access RAM block at the same time. 


Note that all external reads take at least one machine cycle while all external 
writes take at least two machine cycles. However, if an external write is im- 
mediately followed or preceded by an external read cycle, then the external 
write requires three cycles. See Appendix B for details. If the on-chip wait state 
generator is used to add m (m> 0) wait states to an external access, then both 
the external reads and the external writes require m+1 cycles, assuming that 
the external READY line is driven high. In case the READY input line is used 
to add m additional cycles to an external access, then external reads require 
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a nA a aaa a ae 


Example 
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m+1 cycles, and external write accesses require m+2 cycles. See Chapter 6 
for the dicussion on software wait states and Appendix A for READY electrical 
specifications. 


The instruction cycle timings are based on the following assumptions: 


_.] Atleast the next four instructions are fetched from the same memory sec- 
tion (internal or external) that was used to fetch the current instruction (ex- 
cept in case of PC discontinuity instructions like B, CALL, etc.) 


[_.}] Inthe single execution mode, there is no pipeline conflict between the cur- 
rent instruction and the instructions immediately preceding or following 
that instruction. The only exception is the conflict between the fetch phase 
of the pipeline and the memory read/write (if any) access of the instruction 
under consideration. See Chapter 3 for pipeline operation. 


] Inthe repeat execution mode, all conflicts caused by the pipelined execu- 
tion of an instruction are considered. 


Refer to Appendix C for further information on instruction cycle classifications 
and timings. 


Example code is included for each instruction. The effect of the code on 
memory and/or registers is summarized. 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 


Cycles 


Example 1 


Example 2 


{Absolute Value of Accumulator ABS 


Mae Oat eel AEM Cela eter hota ahegtatse eateeentyaas Raye Wainy etter Tepe RK eee 


[label ABS 


None 


15 14 13 #12 11 #10 9 8 7 6 5 4 3 2 +1 = =Q 
1 O 1 1 1 1 1 O 0 0 0 0 0 0 0 0 


(PC) +1 —- PC 
((ACC)| - ACC;0—-C 


Affected by OVM; affects OV and C. 
Not affected by SXM. 


lf the contents of the accumulator are greater than or equal to zero, the accu- 
mulator is unchanged by the execution of ABS. If the contents of the accumula- 
tor are less than zero, the accumulator is replaced by its 2s-complement value. 
The carry bit (C) on the ’C5x is always reset to zero by the execution of this 
instruction. 


Note that 80000000h is a special case. When the overflow mode is not set 
(OVM = 0), the ABS of 80000000h is 80000000h. When the overflow mode is 
set (OVM = 1), the ABS of 80000000h is 7FFFFFFFh. In either case, the OV 
status bit is set. 


1 


| Cycle Timings for a Single Instruction | 


PDA = | PSA 


| Cycle Timings for a Repeat (RPT) Execution 
a ae | 


ABS 
Before Instruction After Instruction 
Acc fsa] «= aces 
C C 
ABS 


Before Instruction After Instruction 


ACC _OFFFFFFFFh acc [o] 
Cc Cc 
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Example 3 


Before Instruction 


ACC 

Cc 

OV 

Example 4 ABS 3 (OVM = 0) 
Before Instruction 
ACC 

Cc 

OV 
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After Instruction 


Acc [0] 7FFFFFFFh 
Cc 


OV 
After Instruction 
Acc [0] 80000000h 
Cc 
OV 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label] ADCB 


None 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 =O 


1 O 14 71 1 1 1 0 0 0 0 14 0 0 0 1 


(PC) +1 PC 
(ACC) + (ACCB) + (C) -» ACC 


Affected by OVM; affects OV and C 


The contents of the accumulator buffer (ACCB) and the value of the carry bit 
(C) are added to the accumulator. The carry bit is set to one if the result of the 
addition generates a carry from the MSB position of the accumulator. 


oma, 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 


ADCB 
Before Instruction After instruction 
acc [1] acc [0] 
C C 
ACCB ACCB 
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Syntax 


Operands 


Opcode 


Execution 
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ADD Accumulator 


Direct: 


Indirect: 


Short: | 


Long: | 


Direct: 


Indirect: 


Petar tae lt itattalt ds nati neaetatatatddtaiditedltainitidiatetatan tas inode tthicrinedrare ed Ghd riditiarad tiated ie aeoneenen Meta imitate ye Nake Satelit atatiahe diatabaiatatatditatitabatatlitata titted hitataititatdenNide ap tat ated piutateietnt tat Mokdeeennteracececetened ORES eee eee 


Direct: [label ADD dma |[,shiftT] 

Indirect: [label ADD {ind} [,shift1 [,nextARP]] 
Short Immediate: [/abe] ADD #k 

Long Immediate: [label ADD #l/k [,shiftZ 


0 < dma < 127 

O<shifti <16 (defaults to 0) 
O =< next ARP <7 

0O<ks255 

—32768 < Ik < 32767 
O<shift2<15 (defaults to 0) 


15 14 #13 #12 #11 £10 9 8 7 6 5 4 3 2 #1 #90 


0 Od 1 0 | SHFTt =| O| Data Memory Address | 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 QO 


SHFT t Ae See Subsection 4.1.2 


15 14 13 #12 #11 =#10 9 8 7 6 5 4 3 2 1 O 


Add to accumulator with shift of 16 
15 14 13 12 11 10 9 8 


oO 1 1 0 0 0 0 Data Memory Address | 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 QO 


oO 1 1 0 0 0 0 141/41 See Subsection 4.1.2 | 


t See Section 4.5. 
Direct or Indirect Addressing: 


(PC) +1 — PC . 
(ACC) + [(dma) x 2shiftt] — ACC 
Affected by SXM and OVM; affects C and OV. 


Short Immediate Addressing: 


(PC) +1 —- PC 
(ACC) +k — ACC 
Affected by OVM; affects C and OV 


Long Immediate Addressing: 


(PC) +2 — PC 
(ACC) + Ik x 2shift2 — ACC 
Affected by SXM and OVM; affects C and OV. 


Assembly Language Instructions 


Add to Accumulator ADD 


Description 


Words 


Cycles 


The contents of the addressed data memory location or an immediate con- 
stant are left-shifted and added to the accumulator. During shifting, low-order 
bits are zero-filled. High-order bits are sign-extended if SXM = 1 and zero-filled 
if SXM = 0. The result is stored in the accumulator. When short immediate ad- 
dressing is used, the addition is unaffected by SXM and is not repeatable. Note 
that when the ARP is updated during indirect addressing, a shift operand must 
be specified. If no shift is desired, a0 may be entered for this operand. 


When adding with a shift of 16, the carry bit is set if the results of the addition 
generates a Carry; otherwise, the carry bit is unaffected. This allows the accu- 
mulation to generate the proper single carry when adding a 32-bit number to 
the accumulator. 


1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 


Direct: [abe] ADD dma |[,shiftT] 
Indirect: [labe ADD {ina} [, shift? [,nextARFP] 


Cycle Timings for a Single Instruction 


ENN (Ea a Se 


Cycle anott fora ole ae Execution 


Fo lh SARAM 
a 


Operand Ex 


t If the operand and the code are in the same SARAM block. 


Short Immediate: [/abej] ADD #k 


Cycle Timings for a Single Instruction 
ee hee ee 5 eee 
Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 
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Cycles 


Example 1 


Example 2 


Example 3 


Example 4 
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Long Immediate: 


[label] ADD #1k [,shift 


Cycle Timings for a Single Instruction 


ADD 


ADD 


ADD 


After Instruction 


Data Memory ee te 
301h 


acc [fo] [oan] 
C 


After Instruction 


ARP 
AR4 | -0303h 


Data Memory 


302h aa) 
acc [0] 
Cc 


After Instruction 


Not Repeatable 
DAT1,1 ;(DP = 6) 
Before Instruction 
Data Memory 
301th feat soteonaaae 
ACC 
C 
*+,0,AR0 
Before Instruction 
ARP 
AR4 
Data Memory 
302h 
ACC 
Cc 
#1h sAdd short immediate 
Before Instruction 
acc a) 
Cc 


acc [0] [0am 
C 


#1111h,1 ;Add long immediate with shift of 1 


Before Instruction 


acc 
C 


After instruction 


acc = [0] L_222ah] 
C 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 
Words 
Cycles 


Example 


[label] ADDB 


None 

15 14 13 12 11 10 9 8 7 
| 0 1 1 1 1 1 0 0 
(PC) +1 — PC 


(ACC) + (ACCB) — ACC 
Affected by OVM; affects C and OV. 


_ Add ACCB to Accumulator ADDB 


OSES SSS SLOTS IOS OOS ISIS IISA 


The contents of the accumulator buffer (ACCB) are added to the accumulator. 
1 
[labe| ADDB 


Cycle Timings for a Single Instruction 
Ca [ee 7 ae cr 


Cycle Timings for a Repeat (RPT) Execution — 


Before Instruction 


ACC 1234h 
Acca 
C 


After Instruction 

ACC 1236h 

acca [0] 
C 


4-33 


Syntax Direct: [labef ADDC dma 
Indirect: [label ADDC {ina} [,next ARP 


Operands 0<dma s 127 
0 <= next ARP <7 


Opcode 
15 14 13 12 11 #10 Q 7 6 5 4 3 2 1 ~=«0 


8 
Direct:| O 1 1 0 0 dO 0 @Q Data Memory Address 
8 


15 14 13 12 11 #=10 =@Q 6 5 4 3 2 1 0 


7 
Indirect! 9 1 #41 #0 0 0 0 0 See Subsection 4.1.2 


Execution (PC) +1 —- PC 
(ACC) + (dma) + (C) — ACC 


Affected by OVM; affects OV and C. Not affected by SXM. 


The contents of the addressed data memory location and the value of the carry 
bit are added to the accumulator with sign extension suppressed. The carry 
bit is then affected in the normal manner. 


The ADDC instruction can be used in performing multiple-precision arithmetic. 
Words 1 


Cycles Direct: [labe] ADDC dma 
Indirect: [label ADDC {ind} [,next ARP 


Cycle Timings for a Single Instruction 


Operand SARAM 
n+1T 


Operands 


t If the operand and the code are in the same SARAM block. 


Example 1 ADDC DATO ;(DP = 6) 
Before Instruction After Instruction 
Data Memory Data Memory 
300h 300h 


acc [1] acc [0] L___ 18h] 
Cc C 


4-34 7 Assembly Language Instructions 


Add ACCB to Accumulator With Carr 


ADDC 


eit ata tA Latha tn itn aC at ats at aati nti aa a id nse aaa nana an Ati lid wlan tae iat Hata tata daa i aa Pall data ai tianlian ati aa al a Wea ti aa ati aha aa a ata aC a alia aaa aaa an a AAA AA AAS Naa Aa ae ea LA AL AOL LA ELC CRA CR tat 


Ltn es Pacatgea tala alata tatstnrsnatstatate! ate! at at ans eytats Sate laestatesssacane caterer testes mately late saetat aM atens cate ta eta etna ene ttre sey een ne nena near ate nana an ese nRtE ne Re ae aes ate gate rear E ee eT een ee sateen Aen net arcane eager Ree ect ne ene nEn enone ene ened ene senate eye age ean ennee a wacee seer enea tata 


Example 2 ADDC *—,AR4 ;(OVM = 0) 
Before Instruction 
ARP el) 
ARO 
Data Memory 

300h 
aco 

C 

[x] 

OV 


After Instruction 


ARP C7 

ARO 
Data Memory 

300h 

ACC 


[9] ° EI 


4-3 


oO 


Syntax Direct: [labef ADDS dma 
Indirect: [/abefl ADDS {ina} [,next ARP 


Operands 0 <dmas 127 
0 s next ARP <7 


Opcode 
15 14 #13 #12 +111 =£:10 Q9 7 6 5 4 3 #2 1 =O 


8 
Direct} 0 1 1 0 0 O 1 Of 0) Data Memory Address 


15 14 13 12 11 10 9 8 7 6 
0 1 1 


5 4 3 2 1 0 
1 0 0 0 0/1. See Subsection 4.1.2 


Indirect: 


Execution (PC) +1 — PC 
(ACC) + (dma) — ACC 
(dma) is an unsigned16-bit number 


Affected by OVM; affects OV and C. 
Not affected by SXM. 


Description The contents of the specified data memory location are added to the accumu- 
lator with sign-extension suppressed. The data is treated as an unsigned 
16-bit number, regardless of SXM. The accumulator contents are treated as 
a signed number. Note that ADDS produces the same results as an ADD in- 
struction with SXM = 0 and a shift count of 0. 


Words 


—, 


Cycles Direct: [abe] ADDS dma 
Indirect: [label ADDS {ina} [,next ARP] 


Cycle Timings for a Single Instruction 
Operand DARAM 


Ea te 

as ce 
eee A ae 
a 


i 
Operand ee na [ea eee 


Cycle Timings for a Repeat (RPT) Execution 
I 205 ie Ee 
[OperanaDaRAM Ta fe fee 
a 


Operand SARAM 


Operand Ext 


t If the operand and the code are in the same SARAM block. 


4-36 Assembly Language Instructions 


Add to Accumulator With Sign-Extension Suppressed ADDS 


Example 1 ADDS DATO ;(DP 
Data Memory 
300h 
ACC 
Cc 
Example 2 ADDS * 
ARP 
ARO 
Data Memory 
300h 
ACC 
C 


= 6 ) 
Before Instruction 


OFOO6h 
00000003h 


Before Instruction 


After Instruction 
Data Memory 


300h OFO06h 
acc [o] 0000FO009h 
Cc 


After instruction 


ARO 
Data Memory 
300h OFFFFh 


acc. [o] 7FFFFFFFh 
C 


4-37 


Syntax Direct: [label ADDT dma 

Indirect: [label ADDT {ina} [,next ARP 
Operands 0 < dma s 127 

0 = next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #0 


Direct} O 1 #1 0 0 O 1 #1401 Data Memory Address 


15 14 13 12 11 #10 9 8 7 6 
1 1] 1 


5 4 2 1 O- 
Indirect: See Subsection 4.1.2 


Execution (PC)+1—- PC 
(ACC) + [(dma) x 27REG1(3-0)] -» (ACC) 
lf SXM = 1: 
Then (dma) is sign-extended. 
lf SXM = 0: 
Then (dma) is not sign-extended. 


Affected by SXM and OVM; affects OV and C. 


Description The data memory value is left-shifted and added to the accumulator, with the 
result replacing the accumulator contents. The left-shift is defined by the four 
LSBs of the TREG1, resulting in shift options from 0 to 15 bits. Sign extension 
on the data memory value is controlled by SXM. The carry bit is set when a 
carry is generated out of the MSB of the accumulator. 


Software compatibility with the C25 can be maintained by setting the TRM bit 
of the PMST status register to zero. This causes any ’C25 instruction that loads 
TREGO to write to all three TREGs. Subsequent calls to the ADDT instruction 
will shift the value by the TREG1 value (which is the same as TREGO), main- 
taining object-code compatibility. 


Words 1 


Cycles 


| | Cycle Timings for a Single Instruction 


PPR | PDA | PSA | 
ae eee ee Bc 
Operand SARAM 1 qT 4 1+p 
i 


| OperandExt Ext oT | 2ed4p | 
4-38 Assembly Language Instructions 


Operand DARAM 
Operand SARAM 


t If the operand and the code are in the same SARAM block. 


Example 1 ADDT DAT127 ;(DP = 4. SXM = 0) 
Before Instruction After Instruction 
Data Memory Data Memory 
027Fh 027Fh 


TREG1 OFF94h TREG1 OFF94h 
ACC OF715h Acc [0] OF7A5h 
Cc 


C 
Example 2 ADDT *—,AR4 ;(SXM = 0) 
Before Instruction After Instruction 
ARP CJ AP C4 
ARO ARO 
Data Memory Data Memory 

027Fh 027Fh 
TREG1 TREG1 
Acc acc] 

C C 


4-3 


fo) 


Syntax [label ADRK #k 


Operands 0<ks 255 


Opcode 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 +0 
1 1 


Short: 


Execution (PC) +1 — PC 
AR(ARP) + 8-bit positive constant —> AR(ARP) 


Description The 8-bit immediate value is added, right-justified, to the currently selected 
auxiliary register (as specified by the current ARP) with the result replacing the 
auxiliary register contents. The addition takes place in the ARAU, with the im- 
mediate value treated as an 8-bit positive integer. Note that all arithmetic oper- 
ations on the auxiliary registers are unsigned. 


Words 1 


Cycles [label] ADRK #k 
Cycle Timings for a Single Instruction 


a 5 ee 


| Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 
Example ADRK #80h 
Before instruction After Instruction 
ARP ARP eae 
ARS = 4321h ARS | 43A1h 


4-40 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Direct: [label AND dma 
Indirect: [labef| AND {ind} [,next ARP 
Long Immediate: [label AND #l/k [,shiffj 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 #1 =O 


Direct:|} O 1 1 Oo 1 1 1 0 Data Memory Address | 


1514 13 12 11:10 9 8 7 6 5 4 3 2 1 +O 
Indirect} 0 1 1 O 1 1 #1 «0 See Subsection 4.1.2 
7 


15 _ 14 #13 #12 #11 #10 #9 8 6 5 4 3 2 1 0 


ese NO es a 
ong: 16-Bit Constant 


AND with ACC long immediate with shift of 16 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 ji 0 
1 0O 1 1 1 #14 41 0 1 £0 0 0 0 0 0 1 


Long: 
16-Bit Constant | 


t See Section 4.5. 


Direct or Indirect Addressing: 

(PC) +1—-PC 

(ACC(15—0)) AND (dma) — ACC(15—0) 
0— ACC(31—16) 


( | 
(ACC(30—-0)) AND Ik x 2shift — ACC 
Not affected by SXM 


If direct or indirect addressing is used, the low word of the accumulator is 
ANDed with a data memory value, and the result is placed in the low word posi- 
tion in the accumulator. The high word of the accumulator is zeroed. If immedi- 
ate addressing is used, the long immediate constant is shifted, and the low- 
order bits below and high-order bits above the shifted value are zeroed. The 
resulting value is ANDed with the accumulator contents. 


ow, 


(Direct or indirect addressing) 


NO 


(Long immediate addressing) 


—" 


4-4 


Cycles 


Example 1 


Example 2 


Example 3 


4-42 


Direct: [label AND dma 
Indirect: [label AND {ina} [,next ARP 


PE 
A 


ot 


Operand Et 
Cycle Timings for a Repeat (RPT) Execution 


Operand SARAM 


Operand Ext 


t If the operand and the code are in the same SARAM block. 
Long Immediate: [labe] AND #/k [,shiffj 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


AND DAT16 ;(DP = 4) 
Before Instruction 


After Instruction 


Data Memory Data Memory 
0210h OOFFh 0210h _ OOFFh 
ACC 12345678h ACC 00000078h 
AND * 


Before Instruction After Instruction 


ARP C..... 9g ARP eT) 

ARO ARO 
Data Memory Data Memory 

0301h 0301h 

ACC ACC 


AND #OO0OFFh,4 
Before Instruction 


ACC 12345678h ACC 


After Instruction 


00000670h 


Assembly Language Instructions 


AND ACCB With Accumulator AND B 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label ANDB 


None 


15. 14 #13 :#12 ~#11:#10 9 8 7 6 5 4 3 2 +1 i020 
1 O 1 1 1 #71 +71 +0 0 0 0 1 0 0 1 =O 


(PC) +1 —~ PC 
(ACC) AND (ACCB) — ACC 


The contents of the accumulator are ANDed with the contents of the accumula- 
tor buffer (ACCB). The result is placed in the accumulator while the accumula- 
tor buffer is unaffected. 


1 


[label] ANDB 


ANDB 
Before Instruction After Instruction 
ACC OFOFFFFFh ACC | -05055555h 


ACCB ACCB 


4-43 


APAC Aad P Register to Accumulator 


Sap aa ny Se OR SHAR aE TE EOP eae nee PaCS POP NSE ce MaBGE TO east ch OM AE SEONG RSE NG?CMa SRDS SESENGRGH HGR Ha RSB DBS SB RNO aR ET Shae SE CaS eee ENE ERNE eee Ne aNE iE SSS ae EE aM ueP NSB ct NSERC aN ES ERE SN NNER NOSE Raa SME? a 


Hap aaa Eat ea ata sana ii a ata a Sanna ana aa aaa en aaa al nia li ina a aa na ai a nin eas ssa aan li anni Sata ant ns a an stan natn rata n'y tian ona ata as ina Maia saa ata analy iat 


Syntax [label] APAC 
Operands None 
Opcode 


15 14 +#13 ~#12 ~#«11:°#10 9 8 7 6 5 4 3 2 1 QO 
1 O 1 147 4 4 4 0 Od O0O O08 O0 O0 1 0 0 


Execution (PC) +1 — PC 
(ACC) + (shifted P register) — ACC 


Affected by PM and OVM; affects OV and C. 
Not affected by SXM. 


Description The contents of the P register are shifted as defined by the PM status bits and 
added to the contents of the accumulator. The result is placed in the accumula- 
tor. APAC is not affected by the SXM bit of the status register; the P register 
is always sign-extended. The APAC instruction is a subset of the LTA, LTD, 
MAC, MACD, MADS, MADD, MPYA, and SQRA instructions. 


Words 1 
Cycles [label APAC 


a 
a 


Cycle Timings for a Repeat (RPT) Execution 
a ee es ee 


Example APAC ;(PM = 01) 
Before Instruction After Instruction 
P p 


ACC acc fo} [Aan 
Cc C 


4-44 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Direct: [label APL [#/k,] dma 
Indirect: [labef APL [#/k,] {ina} [,next ARP 


0 <dma < 127 
Ik: 16-bit constant 
0 < next ARP <7 


6 5 4 3 2 1 0 
Data Memory Address 


6 5 4 3 2 +1 =O 


7 
= 

7 : 
To 


15 14 13 12 11 10 9 
Direct:} 0 1 0 14 #4 +O 1 


8 
0 
15 14 13 12 11 #10 9 8 
0 
8 


Indirect:| O 1 0 1 1 0 1 


15 14 13 #12 #111 #10 Q 6 5 4 3 2 1 0 


Direct: 16-Bit Constant 
15 14°13 12 11°10 9 8 7 6 5 4 3 2 #1 +O 


indie 0 1 Oo 1 1 1 1 0 See Subsection 4.1.2 


16-Bit Constant 


lk unspecified: 
(PC) +1 — PC 
(dma) AND (DBMR) — dma 


Ik specified: 

(PC) +2 — PC 

(dma) AND Ik — dma 
Affects TC. 


If along immediate constant is specified, it is ANDed with the data memory val- 
ue dma. Otherwise, the data memory value is ANDed with the contents of the 
dynamic bit manipulation register (OBMR). In either case, the result is written 
directly back to the data memory location, while the contents of the accumula- 
tor are unaffected. If the result of the AND operation is 0, then the TC bit is set 
to 1. Otherwise, the TC bit is set to 0. 


1 (Second operand DBMR) 
2 (Second operand long immediate) 


Direct: [labe] APL [#/k,] dma 
Indirect: [label APL [#/k,] {ina} [,next ARP| 


Cycle Timings for a Single Instruction | 


APL AND Data Memory Value With DBMR or Long 


Example 1 


Example 2 


Example 3 


4-46 


Direct: 
Indirect: 


Operand DARAM 


Operand DARAM 
Operand SARAM 


Operand Ext - 


Cycle Timings for a Repeat (RPT) Execution 


[label] APL [#/k,] dma 


4n-1+2nd | 4n—1+2nd 


[label] APL [#Ik] {ind} [next ARP] 


[Pa [POA [PSA FE 
[Operand ARAM [2 fe pe *i ee 
See Eee 


Cycle Timings for a Repeat (RPT) Execution | 


ae 


2n-1 2n—1+2p 


t If the operand and the code reside in same SARAM block. 


APL 


#0023h,DAT96 ;(DP = 0) 
Before Instruction 
Data Memory 


60h 
TC 


DAT96 ;(DP = 0) 


Before Instruction 


DBMR OFFOOh 


Data Memory 


60h 1141h 
TC 


#0100h,*,AR6 


Before Instruction 

ARP CF 
TC 

ARS 


Data Memory | 


300h OFFFh 


After Instruction 
Data Memory 


60h 4] 
TC 


After instruction 
DBMR | OFFOOh 


Data Memory 


60h = [0] 1100h 
TC 


After instruction 


arp] CS 
TC 
ARS 


Data Memory 


300h | 0100h 


Assembly Language Instructions 


Se eee AND Data Memory Value With DBMR or Long Constant APL 


AAAI MO ORION INO SOON I OA A IS 


Example 4 APL *, ART 


Before instruction After Instruction 


ARP ae aRP [0] 
TC TC 


AR6 AR6 310h 

DBMR 0303h DBMR 0303h 
Data Memory Data Memory 

310h OEFFh 310h 0203h 


4-47 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 


4-48 


B Branch Unconditionall 


[label B[D| pma |, {ind} [,next ARP] 


0 s pma s 65535 
0 =< next ARP s 7 


Branch unconditional with AR update 
15 14 13 12 11 10 Q 8 


Branch unconditional delayed with AR update 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 #1 Q 


o111 141 4 0 1441/ See Subsection 4.1.2 
16-Bit Constant 


pma — PC 
Modify AR(ARP) and ARP as specified. 


The current auxiliary register and ARP are modified as specified, and control 
is passed to the designated program memory address (pma). Pma can be ei- 
ther a symbolic or numeric address. The one two-word instruction or two 
one-word instructions following the branch instruction are fetched from pro- 
gram memory and executed before the branch is taken, if the branch is a 
delayed branch (specified by the D suffix). 


2 
[labef B[D| pma |, {ind} [,next ARP] 


‘Cycle Timings for a Single Instruction | 
4 | 4 00 dT 4A P4e4pt - 
~ Cycle Timings for a Repeat (RPT) Execution | 
| Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


B 191,*+,AR1 


The value 191 is loaded into the program counter, and the program continues 
executing from that location. The current auxiliary register is incremented by 
1, and ARP is set to auxiliary register 1. 


BD 191 
MAR *+,AR1 
LDP #5 


After the current AR, ARP, and DP are modified as specified, program execu- 
tion continues from location 191. 


Assembly Language Instructions 


Branch to Location Specified by Accumulator BACC 


ORAL ORR LE LEE OEE Ot eee ae iil Cetin iat di ati LA ee i OE eed elie ii iti de Oe te CE KOO Oh el Ot Ot ait 


ESE SSL S SESS P SO ROOT POO OO TO RO POO ISSO SOOOCOOOL OSES SESS OLE SSES SSS OS SECESSION ALITELIES LEE SSSSS SELES SIESTA SIL 


Syntax [labe} BACC[D) 

Operands None 

Opcode 
BACC 
15 14 13 12 11 10 9 8 7 6 § 4 3 #2 1 0 
1 O 1 717 1 1 = +1 +0 0 0 1 #90 +O 0 0 
BACCD 


15 14 #13 :#12 #11:°#10 9 8 7 6 5 4 3 2 $1 O 
1 OO 1 14 1 1 ~=+141 +0 0 0 1 0 0 0 0 1 


Execution ACC(15—0) — PC 


Description Control is passed to the 16-bit address residing in the lower half of the accumu- 
lator. The one two-word instruction or two one-word instructions following the 
branch instruction are fetched from program memory and executed before the 
branch is taken, if the branch is a delayed branch (specified by the D suffix). 


Words 1 
Cycles BACC 


| Cycle Timings for a Single Instruction 


PPR | POA | PSA PE 
a Eee Ee 
| Cycle Timings for a Repeat (RPT) Execution 


: | Not Repeatable 


t The ’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


BACCD (delayed) 


Cycle Timings for a Single Instruction 


Ceca 5) ey 
a ee 5 a 


Cycle Timings for a Repeat (RPT) Execution 


. Not Repeatable 


Example 1 BACC ;(ACC contains the value 191) 


The value 191 is loaded into the program counter, and the program continues 
executing from that location. 


Example 2 BACCD ;(ACC contains the value 191) 
MAR *+,AR1 
LDP #5 


After the current AR, ARP, and DP are modified as specified, program execu- 
tion continues from location 191. 


4-49 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


4-50 


[label] BANZ[D| pma [, {ina} [,next ARP] 


0 s pma s 65535 
Os next ARP <7 


BANZ 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Oo 4 1 14 4 0 141 «4714 See Subsection 4.1.2 
16-Bit Constant 


BANZD 


15.14 #13 #12 #11 :#10 9 8 7 6 5 4 


3 2 1 «0 
O14 1 14 4 4 «14/47 See Subsection 4.1.2 
16-Bit Constant 


lf AR(ARP)=0 
Then pma — PC 
Else (PC) +2 — PC 
Modify AR(ARP) as specified 


Control is passed to the designated program memory address (pma) if the 
contents of the current auxiliary register are not zero. Otherwise, control 
passes to the next instruction. The default modification to AR(ARP) is a decre- 
ment by one. N loop iterations may be executed by initializing an auxiliary reg- 
ister loop counter to N—1 prior to loop entry. The program memory address 
(pma) can be either a symbolic or a numeric address. 


The two one-word instructions or one two-word instruction following the 
branch instruction are fetched from program memory and executed before the 
branch is taken, if the branch is a delayed branch (specified by the D suffix). 


2 
[label BANZ pma [, {ina} [,next ARP] 


Cycle Timings for a Single Instruction 


0 0 
Gondiionstwe A 
[CondtonFase 


Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


t The ’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


Assembly Language Instructions 


Example 1 


Example 2 


Example 3 


eae eeeeeeeaes mmmmmnanch on Auxiliary Register Not Zero BANZ 


[label BANZD pma [, {ina} [,next ARP] 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 
BANZ PGMO 
Before Instruction After instruction 
ARP [ ARP [i 
ARO ARO 


0 is loaded into the program counter, and the program continues executing 
from that location. 


or 
Before Instruction After Instruction 
ARP a) ARP eras) 
ARO ARO = OFFFFh 


The program counter (PC) is incremented by 2, and execution continues from 
that location. 


BANZD PGMO 
LACC #0lh 
LDP #5 
Before Instruction After Instruction 
ARP [ ARP [aa 
ARO ARO 
DP Cd DP Ls) 
ACC ACC 


After the current DP and ACC are modified as specified, program execution 
continues from location 0. 


MAR *,ARO 
LAR AR1,#3 
LAR ARO,#60h 


PGM191 ADD *+,AR1 


BANZ PGM191,AR0 


The contents of data memory locations 60h-63h are added to the accumula- 
tor. 


4-51 


Syntax. 


Operands 


Opcode 


Execution 


Description 


4-52 


[label] BCND[D| pma, [cond7] [,condZ [,...] 


0 < pma s 65535 

Conditions: ACC=0 EQ 
ACC#0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C=0 NC 
C=1 C 
OvV=0 NOV 
Ov=1 OV 
BIO low BIO 
TC=0 NTC 
TC=1 TC 
Unconditionally UNC 

BCND 


15 14 13 12 1110 9 8 7 6 5 4 3 2 14 +0 
ZLVC t ZLvC t 
16-Bit Constant | 


BCNDD 


15 14 13 i : 10 9 
141 1 0 ee 4 
16-Bit Constant 


t See Section 4.5. 


If (condition(s)) 
Then pma — PC 
Else PC +2 — PC 


A branch is taken to program memory address pma if the specified conditions 
are met. Note that not all combinations of conditions are meaningful. Also, note 
that testing BIO is mutually exclusive to testing TC. 


The two one-word instructions or one two-word instruction following the 
branch are fetched from program memory and executed before the branch is 
taken, if the branch is a delayed branch (specified by the D suffix). If the 
delayed instruction is specified, the two instruction words following the 
BCNDD instruction have no effect on the conditions being tested. 


Assembly Language Instructions 


Words 
Cycles 


Example 1 


Example 2 


2 
[label] BCND pma, [cond7] [,conaZ [,...] 


Cycle Timings for a Single Instruction 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


t The ’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


[label BCNDD pma, [cond7] [,condZ [,...] 


Cycle Timings for a Single Instruction 


s =S 
+ 
: SS 
n°) 
— 


Condition False | 2 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


BCND PGM191,LEQ,C 


lf the accumulator contents are less than or equal to zero and the carry bit is 
set, program address 191 is loaded into the program counter, and the program 
continues executing from that location. If these conditions do not hold, execu- 
tion continues from location PC + 2. 


BCNDD PGM191,0V 
MAR * ARI 
LDP = #5 


After the current AR, ARP, and DP are modified as specified, program execu- 
tion continues at location 191 if the overflow flag (OV) in status register STO 
is set. If the flag is not set, execution continues at the instruction following the 
LDP instruction. 


4-53 


BIT Test Bit 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


4-54 


Direct: [label BIT dma, bit code 


Indirect: [label BIT {ina} , bit code [,next ARP| 


0 <dmas 127 

Os next ARP <7 

0 < bit code <15 

15 14 13 12 11 10 9 8 
Direct [01 00 


15 14 13 12 11:10 9 


Indirect:| 0 1 #O 0] BITX t See Subsection 4.1.2 | 


t See Section 4.5. 


(PC) +1 — PC 


8 


6 5 4 3 2 +1 #0 


7 
| 0 | | DataMemoryAddress | 


7 6 5 4 3 2 +1 0 


(dma bit at bit address (15 — bit code)) — TC 


Affects TC. 


The BIT instruction copies the specified bit of the data memory value to the TC 
bit of status register ST1. Note that the BITT, CMPR, LST1, APL, CPL, OPL, 
XPL, and NORM instructions also affect the TC bit in status register ST1. A bit 
code value is specified that corresponds to a certain bit address in the instruc- 
tion, as given by the pause table: 


i ae 
a 


Assembly Language Instructions 


Test Bit BIT 


Cycles Direct: [labe] BIT dma, bit code 
Indirect: [label BIT {ina} , bit code [,next ARP 


PE 
a 


Operand ER 
Cycle Timings for a Repeat (RPT) Execution 


ae" 


Ronin SARAM 
a 


| OperandExt Ext | onend | | n+nd | | on+nd | n+i+ptnd 


t If the operand and the code are in the same SARAM block. 


Example 1 BIT Oh,15 ;(DP = 6).Test LSB at 300h 
Before Instruction After Instruction 
Data Memory Data Memory 
300h 300h 
TC n:) TC eee) 
Example 2 BIT *,0,AR1 ;Test MSB at 310h 
Before Instruction After Instruction 
ARP Ld ARP ee el 
ARO ARO 
Data Memory Data Memory 
310h 310h 
TC re TC etn esl 
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BITT Test Bit Specified by TREG2 


ata lain taal ata atau aS a tala ut Sisal an anita ath aaa Sia aul Isai alain dai aaa nial a aia assault aia ea SaSatal tata ad aa Satala atalino SaDa halal sa aie a aaa aaa tadashi SeSutah iain ial tis stalalatatataiaiabslata' Satna asada siatatatatatat ai ahatatat i alaliatalataSai iit huertatalat tesidseidisi 


Syntax Direct: [label BITT dma 

Indirect: [label BITT {ina} [,next ARP 
Operands 0 <dma s 127 

0 < next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 0 
Direct:} O 1 #1 O 14 14 #14 #1407 Data Memory Address | 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 
1 


Indirect:| 0 1 #1 #O 1 1 1] 14 See Subsection 4.1.2 


Execution (PC) +1 —- PC 
(dma bit at bit address (15 -TREG2(3—0))) — TC 
Affects TC. 
Description The BITT instruction copies the specified bit of the data memory value to the 


TC bit of status register ST1. Note that the BITT, CMPR, LST1, CPL, OPL, 
APL, XPL, and NORM instructions also affect the TC bit in status register ST1. 
The bit address is specified by a bit code value contained in the 4 LSBs of the 
TREG2, as given by the table below. 


Software compatibility with the "C25 can be maintained by setting the TRM bit 
of the PMST status register to zero. This causes any ’C25 instructions that load 
TREGO to write to all three TREGs. Subsequent calls to the BITT instruction 
will use TREG1 value (which is the same as TREGO), maintaining ’C25 object- 
code compatibility. 


ae 

; 
Ee 
CE 
a 


a 


ee 
WSBT ores 
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Example 1 


Example 2 


SSSSOSSESSESESSISOS ESOS SELLS ILS SSSA LILES SEALE S EISELE 


Direct: [label] BITT dma 
Indirect: [/abef BITT {ind} [,next ARP 


0 

[OperansDaRAM Tt 
1 1 1 1+p 

So fe 

a 


Operand SARAM 


Cycle Timings for a Repeat (RPT) Execution 


Operand SARAM n n+p 
n+1it 
Operand Ex 


t If the operand and the code are in the same SARAM block. 


BITT OOh ;(DP = 6). Test bit 14 of data at 300h 
Before Instruction After instruction 
Data Memory Data Memory 
300h 300h 
TREG2 TREG2 
TC Lo, | 


BITT * ;:Test bit 1 of data at 310h 
Before Instruction 


ARP ae ee ARP 
AR1 ARI 

Data Memory Data Memory 
310h 310h 
TREG2 TREG2 
Te are) Te ae) 


4-5 


N 


Syntax General syntax: [label] BLDD src, dst 


Ail valid cases have the general syntax: 


_ Direct K/DMA: [abe BLDD #adar, dma 
Indirect K/DMA: [label BLDD #addr, {ina} [,next ARP 
Direct DMA/K: [abe] BLDD dma, #adadr 
Indirect DMA/K: [abe BLDD {ind}, #addr [,next ARP 


Direct BMAR/DMA: [abe] BLDD BMAR, dma 
Indirect BMAR/DMA: _ [/abe] BLDD BMAR, {ina} [,next ARP) 
Direct DMA/BMAR: [abe] BLDD dma, BMAR 
Indirect DMA/BMAR: _ [label] BLDD {ina}, BMAR [,next ARP) 


Operands 0 s addr s 65535 
0 <dma s 127 
O s next ARP s 7 
Opcode 


15 14 13 #12 11 +#=10 JQ 7 6 5 4 3 2 1 #9 


8 
Direct WLIO 1 0 1 0 O 0 Data Memory Address 
16-Bit Constant 


15.14 #13 °+#12 #11 :°#10 9 8 7 6 5 4 3 2 1 9 


1 0 1 Oo 1 0 oO O See Subsection 4.1.2 


Indirect: 
| 16-Bit Constant 


Block move data to data DEST long immediate 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 0 


Direct: 1 O 1 O 1 Oo Oo 1 Data Memory Address 
oot 16-Bit Constant 


15_ (14 :+#13:~#«12 ~#«+11':#10 9 8 7 6 5 4 3 2 1 0 
1 


Lae See Subsection 4.1.2 


16-Bit Constant 


Indirect: 


Block move data to data with SRC in BMAR 
15 14 13 12 11 10 9 


7 6 5 4 3 2 1 =O 


8 | 
Direct:} 1 O 1 0 1 1 O O Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 
0 


Indirect} 1 0 14 #O 141 #1 #0 ee See Subsection 4.1.2 


Block move data to data with DEST in BMAR 
15 14 13 12 11109 8 7 6 5 4 3 2 1 0 


Direct:; 1 0 1 0 1 1 oO 1 | Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 #0 


Indirect/ 1 0 1 0 14 #14 0 14/1/41 See Subsection 4.1.2 
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Execution 


Description 


Words 


(PFC) -» MCS 


If long immediate: 
(PC) +2 — PC 
#iIk — PFC 

Else: 

(PC) +1 — PC 
(BMAR) — PFC 


While (repeat counter) = 0: 
(src, addressed by PFC) — dst or src — (dst, addressed by PFC) 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 — PFC 
(repeat counter) —1 — repeat counter. 
(src, addressed by PFC) —-dst or src — (dst, addressed by PFC) 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


The word in data memory pointed at by src is copied to a data memory space 
pointed at by dst. The word of the source and/or destination space can be 
pointed at with along immediate value, with the contents of the BMAR register, 
or by adata memory address. Note that not all src/dst combinations of pointer 
types are valid. 


RPT can be used with the BLDD instruction in indirect addressing mode to 
move consecutive words in data memory. The number of words to be moved 
is one greater than the number contained in the repeat counter RPTC at the 
beginning of the instruction. The source or destination address for the BLDD 
instruction specified by the long immediate address or BMAR register contents 
are automatically incremented in repeat mode. If a direct memory address is 
specified, its address is not automatically incremented in repeat mode. Note 
that the source and destination blocks do not have to be entirely on-chip or 
off-chip. Interrupts are inhibited during a BLDD operation used with the RPT 
instruction. When used with RPT, BLDD becomes a single-cycle instruction 
once the RPT pipeline is started. 


1 (One source or destination is specified by the BMAR register) 


2 (One source or destination is specified by a long immediate value) 
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BLDD Block Move From Data Memory to Data Memor 


Cycles Direct K/DMA: [label BLDD #addr, dma 
Indirect K/DMA: [label] BLDD #addr, {ind} [,next ARP| 
Direct DMA/K: [label] BLDD dma, #addr 
Indirect DMA/K: [label] BLDD {ind}, #adar [,next ARP| 


Cycle sno. fora Single Instruction 


Source DARAM 
Destination DARAM 


Source DARAM 
Destination DARAM 
Source SARAM 

Destination DARAM - 


Source Ext 2+gro | 2+dsrc 2+dor¢ 2+dgro+P 
Destination DARAM 

Source DARAM 

Destination SARAM 

Source SARAM 

Destination SARAM 

Source Ext 2+dore 2+dere — 2+doro+P 
Destination SARAM 3+derct 

Source DARAM 3+dust 34+dys¢ 34+dysr 5+dysitP 
Destination Ext 

Destination Ext 

Source Ext 34+doro+ dct | 34+dero+ddst 3+dsro+Cust 5+dero+asr+P- 
Destination Ext 


Cycle Timings for a Repeat (RPT) Instruction 


Source DARAM - n+1 +p 
Destination DARAM 
Source SARAM n+1+p 
Destination DARAM 
Source Ext N+1+Ndgr¢o n+1+Ndgr¢o N+1+Ndgr¢ n+1+ndg-¢+p 
Destination DARAM 
Source DARAM n+1 n+1+p 
Destination SARAM n+3t 
Source SARAM n+1+p 
Destination SARAM | ig ¢ 3 2n-1+p* 
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~~ Cycle Timings for a Repeat (RPT) Instruction (Continued) 


| Source Ext n+1+ndgot n+1+ndgro n+1+Nderc n+1+nde-o+p 
Destination SARAM n+3+ndgot 


Source DARAM 2n+1+Ndgst 2n+1+Ndyez 2n+1+ndysz 2n+1+Ndgs¢t+p 
Destination Ext 


| Source SARAM | 2n+1+Ndgey 2n+1+Nndysy 2n+1+ndyst 2nt+1+ndgs+P 
Destination Ext 
Source Ext 4n—1+ndeotn | 4n—1+ndgotn | 4n—t+ndgotn | 4n+t+ndectndg+P 
Destination Ext dast dast dast | | 


t If the destination operand and the code are in the same SARAM block. 
+ if both the source and the destination operands are in the same SARAM biock. 
$ If both operands and the code are in the same SARAM block. 


Direct BMAR/DMA: [label BLDD BMAR, dma 
Indirect BMAR/DMA: [label] BLDD BMAR, {ing} [,next ARP 


Direct DMA/BMAR: [label BLDD dma, BMAR 
Indirect DMA/BMAR: _ [/labe/] BLDD {inc}, BMAR [,next ARP 


| Cycle Timings for a Single instruction 


| Source DARAM | dg 61 3 3+2p 
Destination DARAM 
Source SARAM 3 3 3+2p | 
Destination DARAM ! 
Source Ext 3+dsro | 3+dor¢ | | 3+ds,¢ | 3+dg-o+2p 
Destination DARAM | | | 
Source DARAM __—i|: 3 | 3 3+2p 
| Destination SARAM | | 4t 
| Source SARAM ; a ie 13 | | 3+2p 
| Destination SARAM 4t | 
| Source Ext - 34+dsr¢ 3+doro | 3+dgro a 3+dspco+2p | 
| Destination SARAM 4+d.,¢1 | 
Source DARAM 4+dgsy 4+dast | 44+dyc¢ 6+dys;+2p 
Destination Ext : 


Source SARAM 4+dyss | A+dyst 4+dyst 6+dys¢+2P 
Destination Ext 


Source Ext 4tderot+ dass 4+docot+dyst 44+deo+dyst 6+d6--+dgs¢+ 2p 
Destination Ext 


Cycle Timings for a Repeat (RPT) Execution 


| Source DARAM n+2 n+2 n+2 n+2+2p 
Destination DARAM | 
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BLDD Block Move From Data Memory to Data Memor 


Cycle Timings for a Repeat (RPT) Execution (Continued) 


Source SARAM n+2 n+2 n+2 N+2+2p 
Destination DARAM 
+2 


Source Ext N+2+Nndero 

Destination DARAM 

Source DARAM n n+2 

Destination SARAM 

Source SARAM n+2 n+2 
Destination SARAM | 2n? 


Source SARAM 
Destination Ext 


Destination Ext 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
$ If both operands and the code are in the same SARAM block. 


321h 


Example 1 BLDD #300h,20h ;(DP = 6) 
Before Instruction After Instruction 
Data Memory Data Memory 
300h 300h 
320h 320h 
Example 2 BLDD *+,#321h,AR3 
Before Instruction After Instruction 
ARP C8 ARP a) 
AR2 AR2 
Data Memory Data Memory 
301h 301h 


32h [oth] 
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Example 3 BLDD 
Example 4 BLDD 
Example 5 RPTK 

BLDD 


BMAR, * 


ARP 
BMAR 
AR2 


Data Memory 
320h 


340h 


00h,BMAR 


Data Memory 
300h 


BMAR 


Data Memory 
320h 


2 
#300h, *+ 


ARP 
ARO 
300h 
301h 
302h 
320h 
321h 
322h 


Before Instruction 


Oth 
OFh 


ul 


;(DP = 6) 
Before Instruction 


OF 
320 


01 


Before Instruction 


7F98 
OFFE6 


9522 


iW 


9315 


ARP 
BMAR 
AR2 


Data Memory 
320h 


340h 


Data Memory 
300h 


BMAR 


Data Memory 
320h 


ARP 
ARO 
300h 
301h 
302h 
320h 
321h 
322h 


oO 
—_ 
Ss 


After Instruction 
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Syntax Direct: [label] BLDP dma 
Indirect: [label BLDP {ina} [,next ARP| 


Operands 0 <dmas 127 
0 <s next ARP <7 


Opcode 
15 14 13 12 11 #10 9Q 6 5 4 3 2 1 #O 


8 7 
Direct} 0 1 O 1 0 1 #14 #1/]0— Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 
1 1 1 


Indirect:[0 7 0 1 0 [i] SeeSubsectiona72 
Execution (PC) +1 —- PC 
(PFC) — MCS 


(BMAR) — PFC 


While (repeat counter) = 0: 
dma -—> (dst, addressed by PFC) 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 PFC 
(repeat counter) —1 — repeat counter. 
dma — (dst, addressed by PFC) 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


Description A word in data memory is copied to a word in program memory space pointed 
at by the BMAR register. The RPT instruction used with the BLDP instruction 
can move consecutive words pointed at indirectly in data memory to a contigu- 
ous program memory space pointed at by the BMAR register. The BMAR reg- 
ister is automatically updated in the repeat mode. Note that the source and 
destination blocks do not have to be entirely on-chip or off-chip. When used 
with RPT, BLDP becomes a single-cycle instruction once the RPT pipeline is 
started. Interrupts are inhibited during a BLDP operation used with RPT. 


Words 1 


Cycles Direct: [abe] BLDP dma 
Indirect: [label] BLDP {ina} [,next ARP| 


Cycle Timings for a Single Instruction 


Source DARAM 
Destination DARAM 


Source SARAM 
Destination DARAM : 
Destination DARAM 

Source DARAM [2 2+p 

Destination SARAM 3t 
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Cycle Timings for a Single Instruction (Continued) 


Nn 

+ 
£ 
q) 


| 


Source SARAM 
Destination SARAM 


2+p 
gt ors 


2+dosr¢ 3+dsrc¢+Pcode 


Destination Ext 4+Past" 


Source Ext 34+ dsr¢+Past 34dsro+Past 34+dsr¢+Past 9+ dsrc+Pdst+Pcode 
Destination Ext 


Cycle Timings for a Repeat (RPT) Execution 


n+ 


- 
mo 


Source Ext 
Destination SARAM 


Source DARAM 
Destination Ext 


2+dorco 
3+Pagst | 


3+Past 


a | 
+ 
_— 


Source DARAM n+1 
Destination DARAM 


1 

Destination DARAM n+2! 

Source Ext n+1+Nderc n+1+ndgrc n+2+Nndsro+Poode 
1 


N+1+Dcode 


Destination DARAM 


Source DARAM n+1 n+1 N+1+Pcode 
Destination SARAM n+2t 


Source SARAM n+1 n+1 n+ N+1+DPcode 
Destination SARAM | 2n—1# | 2n-1t 2n-1# 2n—1+Pcode? 


n+2t or’ 
Source Ext n+1+Ndor¢ 
Destination SARAM 


2n+18 
n+1 +ndorco n+1 +Ndogr¢ N+24Ndgr¢+Pcode 
N+2+nDPerct 
Source DARAM | 2n+14+Npgsy 
Destination Ext 
Source SARAM 2n+1+NPgst 


2N+1+4+NPgst 2N+1+NPgst 2N+2+NPagst+Pcode 
Destination Ext 2n+2+nPact" 
Destination Ext NPdst NPast NPast code 
t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 


$ If both operands and the code are in the same SARAM block. 
4 If the source operand and the code are in the same SARAM block. 


=) 
+ 
ook 


4-65 


BLDP Block Move From Data Memory to Program Memor 
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Example 1 BLDP 00h ; (DP=6 ) 
Before Instruction 
Data Memory Data Memory 
300h OAO89h 300h 
BMAR —_ 2800h BMAR 
Program Memory Program Memory 
2800h 1234h 2800h 
Example 2 BLDP *,ARO 
Before Instruction 
ARP ARP 
AR7 310h AR7 
Data Memory Data Memory 
310h OFOFOh 310h 
BMAR 2800h BMAR 
Program Memory Program Memory 
2800h 1234h 2800h 
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After Instruction 


| 0A089 
2800 


| 


| 


OA089 


After Instruction 


| 


310 


J 


___OFOFO 
____ 2800 


OFOFO 


| 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


General syntax: [labe} BLPD ge, ast 


All valid cases have the general syntax: 

Direct K/DMA: [label BLPD #pma, dma 

Indirect K/DMA: [labe] BLPD #pma, {ina} [,next ARP 
Direct BMAR/DMA: [label] BLPD BMAR, dma 

Indirect BMAR/DMA: _ [labef] BLPD BMAR, {ing} [,next ARP 


15 14 #13 ~#«12 ~*+11:+:10 9 8 7 6 5 4 3 2 ji 0 


Siape bok 0.1, 00... 1. 20) tO Data Memory Address 
16-Bit Constant 


15 14°13 #12 #11':+#10 9 8 7 6 5 4 3 2 ji 0 


1 0 | 0 6«(O { 0 1 | See Subsection 4.1.2 
Indirect: 
16-Bit Constant 


Block move prog to data with source in BMAR 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 QO 


Direct} 1 0 1 0 0 14 #0 0 Data Memory Address 


15 14 13 #12 #%111 #10 Q 7 6 5 4 3 +2 +1 =O 


8 
Indirect} 1 0 1 0 0 1 +0 O See Subsection 4.1.2 


Else: 
(PC) +1 — PC 
(PFC) — MCS 
(BMAR) — PFC 


While (repeat counter) = 0: 
(pma, addressed by PFC) — dst 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 PFC 
(repeat counter) —1 —> repeat counter. 


(pma, addressed by PFC) -> dst, 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


A word in program memory pointed at by the src is copied to data memory 
space pointed at by dst. The first word of the source space can be pointed at 
with a long immediate value or the contents of the BMAR register. The data 
memory destination space is always pointed at by a data memory address or 
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auxiliary register pointer. Note that not all src/dst combinations of pointer types 
are valid. 


RPT can be used with the BLPD instruction if more than one word is to be 
moved. The number of words to be moved is one greater than the number con- 
tained in the repeat counter, RPTC, at the beginning of the instruction. The 
source address specified by the long immediate or BMAR value is automati- 
cally incemented in repeat mode. Note that the source and destination blocks 
do not have to be entirely on-chip or off-chip. Interrupts are inhibited during 
a repeated BLPD instruction. When used with RPT, BLPD becomes a single- 
cycle instruction once the RPT pipeline is started. 


Words 1 (Source is specified by the BMAR register) 
2 (Source is specified by a long immediate) 


Cycles Direct K/DMA: [label BLPD #pma, dma 
Indirect K/DMA: [label] BLPD #pma, {ina} [,next ARP 


| | Cycle Timings fora Single Instruction | _ | 


Source DARAM/ROM | 2. 2 he 2 | 2+Pcode 
Destination DARAM 


Source SARAM | 2 | | 9 2 2+Dcode 
Destination DARAM : 


Source SARAM TD 84dyee 3+ dust 3+dust 5+dusttPcode 
Destination Ext | 

Source Ext 3+Psro+ ast 3+Dsro+dgst 3+Psro+dast 5+Psrctdastt+Pcode 
Destination Ext 


~ Source DARAM/ROM ——si|son# n+1 n+1 | n+1+Pcode 
Destination DARAM 


Source SARAM } n+ | n+1 n+1 N+14+Deode 
Destination DARAM | | 
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Cycle Timings for a Repeat (RPT) Execution (Continued) 


PPR | POAT PSE 
Source Ext | | n+14NDere N+1+NPero N+1+NDereo N+1+NPsret+Pcode 
Destination DARAM : 


| Source DARAM/ROM n+ n+1 n+1 N+1+Dcode 
Destination SARAM n+3t 


Source SARAM n+1 | n+ N+1+Dcode 
Destination SARAM 2n—-1+Pcode* 
| Source Ext | N+14NPsro N+14+NDsrc N+14NPgro N+1+NPsrc+Pcode 
Source DARAM/ROM 2n+1+ndgo¢ | 2n+1+Ndgey 2n+1+Nndgsz 2n+1+NdgsitPcode | 
ee aa ae le 
Source SARAM 2n+1+Ndgsz | 2n+1+Ndgst 2n+14nNdgsz 2nN+1+Ndyst+Pcode 
omiminet ff 
Source Ext | 4n—14MPgrct | 40-14NPeet | 4n—14NPect | 4n+1+NPsro+Ndastt+Pcode 
[oetneentt | Mat | Man | tan | 


T If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 


Direct BMAR/DMA: [abe] BLPD BMAR, dma 
Indirect BMAR/DMA: _ [/labef BLPD BMAR, {ina} [,next ARP] 


Cycle Timings for a Single Instruction 


PPE | | 


3+2Pcode 5 


. Source DARAM/ROM | 
| Destination SARAM 


Source SARAM 
Destination SARAM | 


Source Ext © 


Source DARAM/ROM 
Destination Ext 
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BLPD Block Move From Program Memory to Data Memor 
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| Cycie Timings for a Single Instruction (Continued) | | | 


Source SARAM 4+dyst | 44+dgst 4+dyst 6+dyst+2Pcode 
Destination Ext 


Source Ext 4+Dsrot ast 4+DPsro+dast | 4+Dsro+dgst 6+Dsrct dast+2Pcode 
Destination Ext 


Cycle Timings for a Repeat (RPT) Execution 


Source DARAM/ROM n+2 n+2 n+2 N+2+2pcode 

Destination DARAM | 
Source SARAM n+2 n+2 n+2 N+2+2pcode 

Destination DARAM 

Source Ext N+2+NDPereo N+2+NDsro N+2+NDoreo N+2+NPsrot+2Pcode 
Destination DARAM 

Source DARAM/ROM n+2 n+2 | N+2+2Pcode 

Destination SARAM n+4t 


Source SARAM n+2 N+2+2Pcode 
| 2ni 
N+2+NPsrco N+2+NPero N+2+NPsrot+2Pcode 
N+4+NDoot 
Source DARAM/ROM a | 


Source Ext 
Destination SARAM 


n+2 

n+2 
Destination SARAM ant 2N+2Pcodet 
N+2+NPerot 
2N+2+Ndycz 2n+2+Nndgcrz 2n+2+ndgst 2N+2+Ndgsit2Pcode 
Destination Ext 
Source SARAM 2n+2+ndgst — 2n+2+ndyst 2n+2+ndysz 2N+2+Ndystt+2Pcode 
Destination Ext 
Source Ext 4n+NDPsrot 4n+npgrcot 4n+nPsrot 4N+24NPorotNdgst+ 
Destination Ext Ndastt Ndgst Ndgst 2Pcode 


t If the destination operand and the code are in the same SARAM block. 
+ if both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 


Example 1 BLPD #800h,00h ; (DP=6) 
Before Instruction After Instruction 
Program Memory Program Memory 
800h 800h 
Data Memory Data Memory 
300h 300h 
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Example 2 BLPD #800h,*,AR7 
Before Instruction 
ARP aes) 
ARO 310h 
Program Memory 
800h 1111h 
Data Memory 
310h 0100h 
Example 3 BLPD BMAR,0OOh ;(DP=6) 
Before Instruction 
BMAR 800h 
Program Memory 
800h 
Data Memory 
300h 
Example 4 BLPD BMAR, *+,AR7 
Before Instruction 
ARP tC 
ARO 300h 
BMAR 810h 
Program Memory 
810h 
Data Memory 
300h 0100h 


ARP 
ARO 


Program Memory 
800h 


Data Memory 
310h 


BMAR 


Program Memory 
800h 


Data Memory 
300h 


ARP 
ARO 
BMAR 


Program Memory 
810h 


Data Memory 
300h 


After Instruction 


310 


1111 


1111 


After Instruction 
800 


ii 
Tl 
> => 


After Instruction 


301 
810 


4444 


4444 
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oh. 


BSAR Barrel Shift 


Syntax [label] BSAR_ shift 
Operands 1 < shift s 16 
Opcode 


15 14 #13 #12 #11 :#10 9 8 7 6 5 4 3 2 1 #0 


t See Section 4.5. 


Execution (PC) +1 — PC 
(ACC) / 2shift —- ACC 


Affected by SXM. 


Description The BSAR instruction executes a 1- to 16-bit right-barrel arithmetic shift of the 
accumulator in a single cycle. The sign extension is determined by the sign-ex- 
tension mode bit in status register 1 (ST1). 


Words 1 
Cycles [label BSAR_ shift 


_ Cycle Timings for a Single Instruction 


ae kee Ee 


Cycle Timings for a Repeat (RPT) Execution 


Re 2a Ea 


Example 1 BSAR 16 ~—=§ ;(SXM=0) 
Before Instruction After Instruction 
ACC ACC 

Example 2 BSAR 4 ; (SXM=1) 
Before Instruction After Instruction 
ACC ACC 


4-72 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


EOSIN RAO LOE 


Call Subroutine at Location Specified by Accumulator CALA 


SERCO COROOCIIA A TSI IA  I AT II E E 


[label CALA[D] 


None 


CALA 


156 14 #13 #12 #11':#10 9 8 7 6 5 4 3 2 1 Q 
1 oO 1 1 1 1 1 0 O O 1 1 O O Od QO 


CALLD 


1514 #13 :#12 ~#«11:+#10 9 8 7 6 5 4 3 2 ji QO 
1 O 1 7141 1 ~=7 ~+«4 «40 0 0 +1 1 1 1 O 1 


Nondelayed: PC+1—- TOS 
Delayed: PC +3 — TOS 
ACC(15—-0) — PC 


The current program counter (PC) is incremented and pushed onto the top of 
the stack (TOS). Then, the contents of the lower half of the accumulator are 
loaded into the PC. Execution continues at this address. Ifthe call is a delayed 
call (specified by the D suffix), the one two-word instruction or two one-word 
instructions following the call instruction are fetched from program memory 
and executed before the call is executed. 


The CALA instruction is used to perform computed subroutine calls. 


1 
[label] CALA 


—— Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


T The 'C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken these two instruction words are discarded. 


[label CALAD 


| Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


4-73 


CALA 


PC 
ACC 
TOS 


CALAD 
MAR *+,AR1 
LDP #5 


ARP 
ARO 
DP 
PC 
ACC 
TOS 


Before Instruction 


il 


100 


Before Instruction 


IM 


100h 


PC 
ACC 
TOS 


ARP 
ARO 
DP 
PC 
ACC 
TOS 


After Instruction 


After Instruction 


After the current AR, ARP, and DP are modified as specified, the address of 
the instruction following the LDP instruction is pushed onto the stack, and pro- 


gram execution continues from location 83h. 


Assembly Language Instructions 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Cycles 


[label] CALL[D] pma [,{ina} [,next ARP] 


0 =< pma = 65535 
Os next ARP s 7 


N 


1 1 See Subsection 4.1.2 


15 14 13 12 11 10 9 8 6 5 4 3 2 1 QO 
0 A. 
16-Bit Constant | 


15 14 13 #12 #11 °:10 9 8 7 6 5 4 3 2 | 0 


Oo 1 141 4 4 «4 «4 ~«0 See Subsection 4.1.2 
16-Bit Constant 


Nondelayed: PC +2 — TOS 
Delayed: PC +4 -— TOS 

pma — PC 

Modify AR(ARP) and ARP as specified. 


The current program counter (PC) is incremented and pushed onto the top of 
the stack (TOS). Then, the contents of the program memory address (pma), 
either a symbolic or numeric address, are loaded into the PC. Execution con- 
tinues at this address. The current auxiliary register and ARP are modified as 
specified. If the call is a delayed call (specified by the “D” suffix), the one 
two-word instruction or two one-word instructions following the call instruction 
are fetched from program memory and executed before the call is executed. 
2 


[label CALL pma[,{ina} [,next ARP] 


Cycle Timings for a Single Instruction 
CE ae (NE [NT 
Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


tT The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


[abe] CALLD pma [,{ina} [,next ARP] 


Cycle Timings for a Single Instruction 
p22 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


4-75 


Example 1 CALL PRG191,*+,AR0 


ARP 

AR1 
PC 

TOS 


Before Instruction 


Cs 
Ani 
PC 
TOs 


After Instruction 


| 


~ OBFh 


| 


OBFh is loaded into the program counter, and the program continues executing 


from that location. 


Example 2 CALLD PRG191 


MAR *+,AR1 
LDP = #5 


ARP 
ARO 
DP 
PC 
TOS 


Before Instruction 


aaa) ARP 
ARO 
as DP 
PC 


TOS 


After Instruction 


| 


| 


| OBFh 


After the current AR, ARP, and DP are modified as specified, the address of 
the instruction following the LDP instruction is pushed onto the stack, and pro- 
gram execution continues from location OBFh. 
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Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


[label CC[D| pma [cond] [,condZ [....] 
0 < pma s 65535 


Conditions: ACC=0 EQ 
ACC#0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C=0 NC 
C=1 C 
OV=0 NOV 
OV=1 OV 
TC=0 NTC 
TC=1 TC 
BIO low BIO 
Unconditionally UNC 

CC 


16-Bit Constant 


t See Section 4.5. 


lf(condition(s)) 
Then 
Nondelayed: PC +2 — TOS 
Delayed: PC +4 — TOS 
pma — PC 
Else 
PC +2 — PC 


Control is passed to the program memory address pma if the specified condi- 
tions are met. Note that not all combinations of conditions are meaningful. In 
addition, the NTC, TC, and BIO conditions are mutually exclusive. If the call 
is a delayed call (specified by the “D” suffix), the two one-word instructions or 
the one two-word instruction following the call are fetched from program 
memory and executed before the call is executed. The CC instruction operates 
like the CALL instruction if all conditions are true. 


2 
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Cycles 


Example 1 


Example 2 
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[label] CC pma[cond7] [,condZ |,...] 


Cycle Timings for a Single Instruction | 
PR a ee OS (a 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken these two instruction words are discarded. 


[labe] CCD pma [cond] [,condZ {,...] 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


cc PGM191,LEQ,C 


lf the accumulator contents are less than or equal to zero and the carry bit is 
set, OBFh is loaded into the program counter, and the program continues ex- 
ecuting from that location. If the conditions are not met, execution continues 
at the instruction following the CC instruction. 


CCD PGM191,LEQ,C 
MAR *+,AR1 
LDP #5 


The current AR, ARP, and DP are modified as specified. If the accumulator 
contents are less than or equal to zero and the carry bit is set, the address of 
the instruction following the LDP instruction is pushed onto the stack and pro- 
gram execution continues from location OBFh. If the conditions are not met, 
execution continues at the instruction following the LDP instruction. 


Assembly Language Instructions 


_ Clear Control Bit CLRC 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


[fabef| CLRC control bit 
Control bit. STO, ST1 bit (from: {C, CNF, HM, INTM, OVM, TC, SXM, XF}) 


Reset overflow mode (OVM) 
_15 14 13 #12 11: #10 9 8 7 6 5 4 3 2 1 0 
1 oO 1 14 14 14 14 O O0O 4 0 0 0 0 1 =O 


Reset sign extension mode (SXM) 
15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 Q 
1 O 14 4 4 4 +4 0 0 1 0 0 0 4 +4 = 0 


Reset hold mode (HM) 
15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 90 
1 O 14 #1 1 1 1 0 0e 1 0 0 14 0 0 0 


Reset TC bit 
15 14 13 12 11109 8 %7F 6 5 4 3 2 1 0 
1 0 1 1 1 1 1 0 O 1 0 0 1 0 1 0 


Reset carry (C) 
15 #14 13 #12 #11 #10 9 8 7 6 5 4 3 2 $71 #9 


Reset CNE bit 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O-| 


Reset INTM bit 
15 14 13 12 11 109 8 7 6 5 4 3 2 1 0 


Reset XF pin 
15 14 13 12 li 10 9 8 7 6 5 4 3 2 1 9 


(PC) +1 —- PC 
0 — controi bit 


The specified control bit is set to a logic zero. Note that the LST instruction may 
also be used to load STO and ST1. See subsection 3.6.3, Status and Control 
Registers, for more information on each of these control bits. 


1 
amass CLRC control bit 


Cae 6, ie 1 nn 
a 


| Cycle Timings for a noes ee Execution 
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CLRC Clear Control Bit 


Example CLRC TC 7TC is bit 11 of ST1l 
Before instruction After instruction 


sT1 sT1 [ xtronh] 


4-80 Assembly Language Instructions 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Example 


Complement Accumulator CMPL 


(label CMPL 


None 


15 14 #13 #12 ~#«11':#10 9 8 7 6 5 4 3 2 1 OQ 
1 O 1 1 #1 +1 ~=«71 «0 0 0 0 0 0 0 0 1 


(PC) +1 — PC 
(ACC) — ACC 


The contents of the accumulator are replaced with its logical inversion (ones 
complement). The carry bit is unaffected. 


_— 


[label] CMPL 


Cycle Timings for a Single Instruction 
ae ee re i eee 
Cycle Timings for a Repeat (RPT) Execution 


Before Instruction After Instruction 


ACC 0F 7982513 ACC 0867DAECh 
Cc Cc 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-82 


[label] CMPR constant 
0<CMs3 


t See Section 4.5. 


(PC) +1 — PC 
Compare AR(ARP) to ARCR, placing result in TC bit of status register ST1. 


Affects TC; affected by NDX. 
Not affected by SXM; does not affect SXM. 


The CMPR instruction performs a comparison specified by the value of CM: 


If CM = 00, test if AR(ARP) = ARCR 
If CM = 01, test if AR(ARP) < ARCR 
If CM = 10, test if AR(ARP) > ARCR 
If CM = 11, test if AR(ARP) « ARCR 


If the condition is true, the TC bit is set to 1. If the condition is false, the TC bit 
is set to 0. 


Software compatibility with "C25 can be maintained by resetting the NDX bit 
inthe PMST register to 0. This causes any ’C25 instruction that loads auxiliary 
register 0 (ARO) to load the ARCR register also. This allows source-code com- 
patibility with the 'C25. Note that the auxiliary registers are treated as unsigned 
integers in the comparisons. 


1 
[label] CMPR constant 


1+ 
Cycle Timings for a Repeat (RPT) Execution 
i 


CMPR 2 
Before Instruction After Instruction 
ARP tC ARP Gee) 
ARCR ARCR 
AR4 [SFA AR4 
TC ae) TC 


Assembly Language Instructions 


immediate With Data Value CPL 


SLOG OOOO NOOR DENIER MSI 


Compare DBMR or Long 


Syntax Direct: [label CPL [,#/k] dma 

Indirect: [label CPL [,#/kj {ind} [,next ARP 
Operands 0<dmas 127 

Ik: 16-bit constant 

0 < next ARP s 7 
Opcode Compare DBMR to data value 


15 14 13 12 11 10 9 8 7 6 5 4 38 2 14 +O 
Direct:[0 1 01 1 0 1 1] 0]  DalaMemor, Address 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 #0 


Indirect} O 1 O 41 1 0 1 4]1— See Subsection 4.1.2 


Compare data with long immediate 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 9 


1 0 1 1 1 1 110] Data Memory Address 


Direct: 16-Bit Constant 


ee eS oa eae 4.1 : 
16-Bit Constant 


Execution lk unspecified: 
(PC) +1 — PC 
Compare DBMR contents to (dma). 
lf (DBMR) = (dma), 
TC =1; 
Else, 
TC =0. 


Indirect: 


lk specified: 
(PC) +2 — PC 


Compare Ik to (dma). 
If Ik = (dma), 

TC =1; 
Else 

TC =0. 


Affects TC. 
Not affected by SXM. 


Description If the two quantities involved in the comparison are equal, the TC bit is set to 
one. TC is set to zero otherwise. 


Words 1 (If long immediate value is not specified) 


Words 2 (If long immediate value is specified) 
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Cycles 


Example 1 


4-84 


pare DBMR or Long Immediate With Data Value 


Sabattus ts aah na Anas ata aaa a iene ati a as aaa atta ata tlt a aaa aaa at a alltel aan a nai nis anid i ae aati tata atti lai lt aly aid aeslasadntatatal tal Sat tenia chalat al oe aia alae eaten eins talala tain ate ea tatiana Coats 


Direct: [label] CPL dma 
Indirect: [label CPL {ina} [,next ARP 


aie Tings Tora iain 
I 
i 


Cycle Timings for a Repeat (RPT) Execution 


esa 
EL a A 


Operand SARAM 


Operand Ex 


t If the operand and the code are in the same SARAM block. 


Direct: [label CPL #/k dma 
Indirect: [label CPL #/k {ina} [,next ARP 


Cycle Timings for a Single Instruction 
Operand DARAM 


pa__[ Poa | Pa 

ae Ce 
ee oe ~ 

a ee 


0 
[Operand ARAM [met 


Operand SARAM n+1 n+1 n+1- 


—[tpend | 


Operand Ext 


t If the operand and the code are in the same SARAM block. 


CPL #060h,60h 


Before Instruction After instruction 
Data Memory Data Memory 


60h 60h 
TC TC ae) 


Assembly Language Instructions 


Immediate With Data Value CPL 


Example 2 CPL 60h 


Data Memory 
60h 


DBMR 
TC 


Example 3 CPL 


ARP 
AR7 


Data Memory 
300h 


TC 


Example 4 CPL 


AR6 


Data Memory 
300h 


DBMR 
TC 


#0F1h,*,AR6 


Data Memory 
60h 


DBMR 
TC 


ARP 
AR7 


Data Memory 
300h 


TC 


ARP 
AR6 


Data Memory 
300h 


DBMR 
TC 


After Instruction 


After Instruction 


300 


OF 1 


After instruction 


300 


OF1 
| OFO 


| 


4-8 


or 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 


Cycles 


Example 1 


Example 2 
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[label] CRGT 


None 


15 14 #13 #12 +1: #10 9 8 7 6 5 4 3 2 1 JO 
1 O 14 71 = 1 1 1 0 0 0 0 14 14 O 1 1 


(PC) +1 — PC 
If (ACC) > (ACCB) 

Then (ACC) — ACCB;1— C 
If (ACC) < (ACCB) 

Then (ACCB) — ACC;0— C 
If (ACC) = (ACCB) 

Then 1 > C 


Affects C. 


The contents of the accumulator (ACC) are compared to the contents of the 
accumulator buffer (ACCB). The larger value (signed) is loaded into both regis- 
ters. If the contents of the accumulator are greater than or equal to the contents 
of the accumulator buffer, the carry bit is set to 1. Otherwise, it is set to 0. 


1 
[label CRGT 


Cycle Timings for a Single Instruction 


CRGT 
Before Instruction After Instruction 
ACCB ACCB [hI 
ACC [s«CS hI ACC 
C Seer) C See) 
CRGT 
Before Instruction After Instruction 
ACCB ACCB 
ACC ACC 
c aa c aes | 


Assembly Language Instructions 


lest for ACC Smaller ThanACCB_ CRLT 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 


Cycles 


Example 1 


Example 2 


[label] CRLT 


None 


15 14 + #13 «12 ~«11~=«410~S OD 8B COUT LCUCG lc UL GK CUD LU? LULdiClD 
1 O 1 #4 = 1 1 1 0 0 0 0 1 1 O 1 1 


(PC) +1 —- PC 
If (ACC) < (ACCB) _ 

Then (ACC) — ACCB;1— C 
If (ACC) > (ACCB) 

Then (ACCB) — ACC;0— C 
If (ACC) = (ACCB) 

ThendO — C 


Affects C. 


The contents of the accumulator (ACC) are compared to the contents of the 
accumulator buffer (ACCB). The smaller (signed) value is loaded into both reg- 
isters. If the contents of the accumulator are less than the contents of the accu- 
mulator buffer, the carry bit is set to 1. Otherwise it is set to 0. 


1 
[label] CRLT 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
a a ee ee 


CRLT 
Before Instruction After Instruction 
ACCB [sh] ACB 
ACC ACC 
c fe) c Ca) 
CRLT 
Before Instruction After instruction 
ACCB ACCE 
ACC ACC 
c ass c ieee) 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Direct: [label] DMOV dma 
Indirect: [label DMOV {ing} [,next ARF 


0<dmas 127 
0 <s next ARP s7 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 QO 
Direct} O 1 #1 #14 #O 1 1 #10) Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #O 


indirect[_O 11101 1 1] 1] See Subsection 412 


(PC) +1 —- PC 
(dma) — dma+1 


Affected by CNF and OVLY. 


The contents of the specified data memory address are copied into the con- 
tents of the next higher address. DMOV works only within on-chip data RAM 
blocks. It works within any configurable RAM block if that block is configured 
as data memory. In addition, the data move function is continuous across block 
boundaries. The data move function cannot be used on external data memory 
or memory-mapped registers. If used on external memory or memory-mapped 
registers, DMOV will read the specified memory location but will perform no 
operations. 


When data is copied from the addressed location to the next higher location, 
the contents of the addressed location remain unaltered. 


The data move function is useful in implementing the z~' delay encountered 
in digital signal-processing. The DMOV function is included in the LTD, MACD, 
and MADD instructions (see the LTD, MACD, and MADD instructions for more 
information). 


1 


Direct: [label DMOV dma 
Indirect: [label] DMOV {ina} [,next ARP 


Cycle Timings for a Single Instruction 
OperandDaRAM [Tt Tt 


Operand SARAM 1 1 1 


Assembly Language Instructions 


Wont atn Sno away win anna nna nwa at ate at aay any nn aa ahaa at xa 


Example 1 


Example 2 


Data Move in Data Memory DMOV 


Cycle Timings for a Repeat (RPT) Execution 
oni inal 


Operand SARAM 2n-2 2n-2 2n—-2+p 
2n+it 
Operand Ext 4n-2+2nd | 4n-2+2nd | 4n-2+2nd 4n+1+2nd+p 


t If the operand and the code are in the same SARAM block. 


DMOV DATS ;(DP = 6) 


Before Instruction After Instruction 


Data Memory Data Memory 

308h 308h ee) 
Data Memory Data Memory 

309h 309h 43h 


DMOV *, ARI 


Before Instruction After Instruction 


a | 


AR1 30Ah AR1 30Ah 


Data Memory Data Memory 
30Ah 40h 30Ah 
Data Memory Data Memory 
30Bh 30Bh 40h 
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co 


EXAR Exchange ACCB With Accumulator 


LO renee an ET NT TILL LS PILE LAD NELLA IRD TOL TDP I ALDARA PA PALS ITIL DTS TOTES 


Syntax [label EXAR 
Operands None 
Opcode 


15 14 #13 #12 #11':+#10 9 8 7 6 5 
1 O 141 14 4 4+ 1 OO O80 0 O 4 4 1T =O 1 


Execution (PC) +1 — PC 
(ACCB) < (ACC) 
Description The contents of the accumulator is exchanged (switched) with the contents of 
the accumulator buffer (ACCB). 
Words 1 
Cycles [labef| EXAR 
Cycle Timings for a Single Instruction 
Example EXAR 
Before Instruction After Instruction 
ACC ACC 
ACCB ACCB 


4-90 Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 


Cycles 


Example 


[label] IDLE 


None 


15. (14 #13 :#12 «11 :°#10 9 8 7 6 5 4 3 2 1 #Q 
1 O 1 1 #1 ~=1 ~=+1 +0 0 0 1 0 0 0 1 ~=«20 


(PC) +1 — PC 
Affected by INTM. 


The IDLE instruction forces the program being executed to wait until an un- 
masked interrupt (external or internal) or reset occurs. The PC is incremented 
only once, and the device remains in an idle state until interrupted. 


The idle state is exited by an unmasked interrupt even if INTM is 1. If INTM is 
1, the program scontinue executing at the instruction following the IDLE. If 
INTM is 0, the program branches to the corresponding interrupt service rou- 
tine. Execution of the IDLE instruction causes the ’C5x to enter the power- 
down mode. During the idle mode, the timer and serial port peripherals are still 
active. Therefore, timer and peripheral interrupts, as well as reset or external 
interrupts, will remove the processor from the idle mode. 


1 
[label IDLE 


Cycle Timings for a Single Instruction 


Not Repeatable 


IDLE ;The processor idles until a reset or unmasked 
;interrupt occurs. 
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IDLE2 = /dle Until Interrupt — Low-Power Mode 


Syntax [label IDLE2 
Operands None 
Opcode 


15 14 #13 #12 #11 :°#10 9 8 7 6 5 4 3 2 1 #0 
1 O 141 14 74 4 4 0 0 0 7 0 0 0 14 1 


Execution (PC) +1 — PC 
Affected by INTM. 
Description The IDLE2 instruction removes the functional clock input from the internal de- 


vice. This allows for an extremely low power mode. The PC is incremented 
only once, and the device remains in an idle state until interrupted by reset or 
an unmasked interrupt. 


The low power mode is exited by an unmasked interrupt even if INTM is high. 
If INTM is high, the program continues executing at the instruction following 
the IDLE2. If INTM is low, then the program branches to the corresponding in- 
terrupt service routine. Execution of the IDLE2 instruction causes the ’'C5x to 
enter the power-down mode. Unlike the idle mode, in the idle2 mode the pe- 
ripherals (serial ports or timer) are not active. 


The idle2 mode is exited by a low logic level on an external interrupt 
(INT{—-INT4), RS, or NMI with a duration of at least five machine cycles since 
interrupts are not latched as in normal device operation. 


Words 1 
Cycles [label IDLE2 


Cycle Timings for a Single Instruction 


Not Repeatable 


Example IDLE2 ;The processor idles until a reset or unmasked external 
;interrupt occurs. 


4-92 Assembly Language Instructions 


Input Data From Port IN 


Syntax Direct: [label] IN dma, PA 
Indirect: [label] IN {ina} ,PA[,next ARP 
Operands 0<dmas 127 
0 s next ARP s7 
O0=< PA =< 65535 
Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 ji Q 
nirect [10 101 1 1 1][0]  DataMemory Address 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 Q 
ndivect| 10 1 0 1 1 1 1]1]  SeeSubsectionat2 
16-Bit Constant 
Execution (PC) +2 — PC 
While (repeat counter) = 0 
Port address —> address bus A15—A0 
Data bus D15—DO — dma 
Port address — dma 
Port address + 1 —» Port address 
(repeat counter — 1) — repeat counter 
Description The IN instruction reads a 16-bit value from an external I/O port into the speci- 
fied data memory location. The IS line goes low to indicate an I/O access, and 
the STRB, RD, and READY timings are the same as for an external data 
memory read. Note that port addresses 50h-5Fh are memory-mapped (see 
subsection 5.1.1), but the other port addresses are not. 
RPT can be used with the IN instruction to read in consecutive words from I/O 
space to data space. In the repeat mode, the port address (PA) is incremented 
after each access. 
Words 2 
Cycles Direct: [label] IN dma, PA 


Indirect: [label IN {ina} ,PA [,next ARP] 


Cycle Timings for a Single Instruction 


Destination SARAM 241057 2+10src¢ 2+1Ogr¢ 3+10sro+2P code 
3+i0g7¢1 


Destination Ext 3+dysttiOgrc 3+dysttiOsrc 6+dyst+iOsre+2Pcode 
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Port 


ete estate aie oe ee Ne eT ae ae NE, wee Oe TTT RENT REPO RE COLELLO EEN DETER TL OOO COL AOR OD ENE SOE CD ELLE L LENCE LCC EEE RL EEN AL ETE DRAPE EE OCT DAL EOP AL AD EE EE EEE AS EC TADS ECA TOLD LOO, TEAL D OD FE ALE eA ETT ISI OPES LE SELLE OPED TLIO OL OE DEEL AOI ETO OPO OLED LA OE DEI! 


IN Input Data From 


fC | Cycle Timings for a Repeat (RPT) Execution | | 


Destination DARAM | 2n+nidg-c 2n+1+Ni0sre+2Pcode 


~ Destination SARAM | 2n+nidgrc | 2n+tnidgre 2ntNidsre 2nt1+Nidsc+2Pcode | 
| 2n+2+nidg;ot 


Destination Ext 4n—1+ndgs;+ | 4n—1+Ndyge¢t+ 4n—1+Nndgst+ 4n+2+ndystNnidgrot 
MNOsre NOsre NOsre 2Pcode 


t If the destination operand and the code are in the same SARAM block. 


Example 1 IN DAT7,PA5 ;Read in word from peripheral on port 
saddress 5. Store in data memory location 
:307h (DP=6). 


Example 2 IN *, PAO ;Read in word from peripheral on port 


saddress 0. Store in data memory location 
sspecified by current auxiliary register. 


4-94 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


[label] INTR k 


Osks31 


15 14 #13 #12 #11 :#10 9 8 7 6 5 4 3 2 1 9QO 


1 O 4 #74 #74 74 4+ =O O 1 1 INTR# t 


T See Section 4.5. 


(PC) +1 — stack 
corresponding interrupt vector — PC 


Not affected by INTM. 
Affects INTM. 


The INTR instruction is a software interrupt that transfers program control to 
the program memory address specified by k (see the following table). The in- 
struction allows any interrupt service routine to be executed from your soft- 
ware. During execution of the instruction, the contents of PC + 1 is pushed onto 
the stack. Note that the interrupt mask has no effect on the INTR instruction. 
An INTR interrupt for the external interrupts (INT1—INT4) looks exactly like an 
external interrupt (an interrupt acknowledge is generated, the appropriate bit 
in the IFR is cleared, interrupts are globally disabled (INTM = 1), and context 
is automatically saved). See subsection 5.1.2 for a complete description of in- 
terrupt operation. 


CT terupt—]Tocation J] tterrapt | tosaton — 


Reserved 


OO 
OO 
a 
OO EC 
a A 
efron [ttn | 208 
a 
[ie Reseed [ran] [seiner | aan 
[Resend [08 |_| Userdtnes [ 68 
(72 | Fesenes [1028 [_Userdotines | 20m 
i Fesewes[ tan |_| Userdotines [aan 
[i [Fesewes[ sch] 20 [serine [20h 
A 
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INTR = Soft Interrupt 


Words 1 
Cycles [label INTR k 


Not Repeatable 


t The ’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


Example INTR 3 ;Control is passed to program memory location 6h 
;PC + 1 is pushed onto the stack. 
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Load Accumulator With ACCB LACB 


Syntax 


Operands 


Opcode 


Execution 


Description 
Words 
Cycles 


Example 


[labe] LACB 


None 


15 14 #13 #12 #11:°:10 9 8 7 6 5 4 3 2 i QO 


pare 
o>) 
= 
a 
ear 
_ 
—_ 
2) 
Lo) 
oO 
o>) 
—_ 
— 
oak 
= 
=k 


(PC) +1 — PC 
(ACCB) — ACC 


The accumulator is loaded with the contents of the accumulator buffer (ACCB). 


omc, 


[abe] LACB 


Cycle Timings for a Single Instruction 
A RO 
| Cycle Timings for a Repeat (RPT) Execution 


LACB 
Before Instruction After Instruction 
ACC ACC [__ SSS5AAAAN | 
ACCB 5555AAAAh | ACCB 
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LACC Load Accumulato 
OS ih aS RN SR ER RO 


SOEs Sess sees SS SSS 


Syntax Direct: [label] LACC dma [,shift1] 
Indirect: [labell LACC {ind} [,shift1 [,next ARP] 
Immediate: [label LACC #/k [,shiftZ 
Operands 0 <dmas 127 
O s next ARP s 7 
O<shifti<16 (defaults to 0) 
—32768 s Ik < 32767 
Osshift2<15 (defaults to 0) 
Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 = =O 
Direct: 00 0 1] SHFT! | 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
Indirect 00 0 1]  SHFT’  |1]|  SeeSubseclon4i2 
15 14 13 12 11 10 9 8 7 6 5 4 3 +2 +#$1 #20 
1 014 14 4 4 4 +4 +4 0 0 0]  SHETt | 
Long: 
Load ACC with shift of 16 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
Direct: [0 7 1 0 1 0 1 O] 0]  DataMemoryAddress | 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 = #@Q 
Indiect|[ O01 1 0 7 0 1 O|t|  SeeSubsedlon4i2 
t See Section 4.5. 
Execution Direct or Indirect Addressing: 
(PC)+1 — PC 
(dma) x 2shiftt _. ACC 
Long Immediate Addressing: 
(PC) +2 — PC 
Ik x gshift2 _, ACC 
Affected by SXM. 
Description The contents of the specified data memory address or a 16-bit constant are 
left-shifted and loaded into the accumulator. During shifting, low-order bits are 
zero-filled. High-order bits are sign-extended if SXM = 1 and zeroed if SXM = 0. 
Words 1 (Direct or indirect addressing) 
2 (Long immediate addressing) 
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Cycles 


Example 1 


Example 2 


Direct: [labe LACC dma [,shift1] 
Indirect: [label LACC {ina} [,shift? [,next ARP] 


Operand SARAM 


1 
Operand Ext 1+d 
Cycle Timings for a Repeat (RPT) Execution 
Operand DARAM 


Operand SARAM 


| Operand Ext n+nd 


t If the operand and the code are in the same SARAM block. 
Immediate: [label LACC #/k [,shiftZ 


Not Repeatable 


LACC DAT6,4 ;(DP = 8. SXM = 0) 
Before Instruction After Instruction 
Data Memory Data Memory 


406h 406h 
acc TESTE 
C C 


LACC *,4 ;(SXM = 0) 


Before Instruction After Instruction 


ARP ieanes: ARP aia: 
AR2 AR2 


Data Memory Data Memory | 
300h OFFh 300h | OFFh 
ACC 012345678 ACC = OFFOh 
Cc Cc 
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LACC Load Accumulator With Shift 


Example 3 LACC #F000h,1 ;(SXM = 1) 
Before Instruction After instruction 
ACC 012345678h ACC OFFFFEOOOh 
C C 
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Syntax Direct: [label] LACL dma 


Indirect: [label LACL {ina} [,next ARF 
Immediate: [label LACL #k 
Operands 0 <dmas 127 
0 < next ARP <7 
0<ks 255 
Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 ~=90 
Direc![O 1 +1 0 1 0 0 14]0]  DataMemoryAddress 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 0 
Indirect:[ 07 1 0 7 0 0 1] 1]  SeeSubsection4i2 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 0 
ShortImmediate:["1__O 1 +1 +1 +O O 1]  BitConstant | 
Execution (PC) +1 —- PC 
Direct or Indirect Addressing: 
0 — ACC(31-16) 
(dma) — ACC(15-0) 
Short Immediate Addressing: 
0 — ACC(31-8) 
k — ACC(7-0) 
Not affected by SXM. 
Description The contents of the addressed data memory location or a zero-extended 8-bit 


constant are loaded into the 16 low-order bits of the accumulator. The upper 
half of the accumulator is zeroed. The data is treated as an unsigned 16-bit 
number rather than a 2s-complement number. There is no sign-extension of 
the operand with this instruction, regardless of the state of SXM. 


Words 1 
Cycles Direct: [label LACL dma 
Indirect: [label LACL {ina} [,next ARP 


Cycle Timings for a Single Instruction 


Operand DARAM 


Operand SARAM 


Example 1 


Example 2 


Example 3 
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Operand DARAM _ 


| Operand SARAM * 


| Cycie Timings for a Repeat (RPT) Execution 
POR [reas ei 


n+p 


n n+p _ 
| n+1t 


Operand et rand 


t If the operand and the code are in the same SARAM block. 


Immediate: [label] LACL #k 


es 
ce ia 5 ee 


Cycle Timings for a Repeat (RPT) Execution _ 


Not Repeatable 


LACL DAT1 ;(DP = 6) 
Before Instruction 
Data Memory 


301h 
ACC 7FFFFFFFh 


Before Instruction 


ARP Eas 
ARO 


Data Memory 
401h 
ACC 
C 
LACL #10h 
Before Instruction 
ACC X 7FFFFFFFh 


After Instruction 
Data Memory 


301h 
ACC 
C 


After instruction 


me Cd 


ARO 
Data Memory 
40th OOFFh 


ACC OFFh 
Cc 


After Instruction 
ACC X 010h 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Direct: [label LACT dma 
Indirect: [label] LACT {ind} [,next ARF 


0<dma sx 127 
0 < next ARP <7 


15 14 13 12 11109 8 7 6 5 4 3 2 1 = @9O 
Direc:['0 1 1 0 1 0 1 17 Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


Indirect} 0 1 1 0 1 0 1 aes See Subsection 4.1.2 


(PC) +1 — PC 
(dma) x 2TREGI(3-0) -» ACC 
If SXM = 1: 
Then (dma) is sign-extended. 


If SXM = 0: 
Then (dma) is not sign-extended. 


Affected by SXM. 


The LACT instruction loads the accumulator with a data memory value that has 
been left-shifted. The left-shift is specified by the four LSBs of TREG1, result- 
ing in shift options from 0 to 15 bits. Using TREG1’s contents as a shift code 
provides a dynamic shift mechanism. During shifting, the high-order bits are 
sign-extended if SXM = 1 and zeroed if SXM = 0. 


LACT may be used to denormalize a floating-point number if the actual expo- 
nent is placed in the four LSBs of the T register and the mantissa is referenced 
by the data memory address. Note that this method of denormalization can be 
used only when the magnitude of the exponent is four bits or less. 


Software compatibility with the 'C25 can be maintained by setting the TRM bit 
of the PMST status register to zero. This causes any ’C25 instruction that loads 
TREGO to write to all three TREGs. Subsequent calls to LACT will contain the 
correct shift value in TREG1, maintaining object-code compatibility. 


1 


Direct: [label] LACT dma 
Indirect: [label LACT {ind} [,next ARP 


Cycle Timings for a Single Instruction 


Operand DARAM re il 


LPR | POA | 
Operand SARAM I> 4 


LACT Load Accumulator With Shift specified by TREG1 


Cycle Timings for a Repeat (RPT) Execution 


Operand DARAM : 


| Operand SARAM a 


Operand Ex 


t If the operand and the code are in the same SARAM block. 


Example 1 LACT DAT1 ;(DP = 6. SXM = 0) 
Before Instruction After Instruction 
Data Memory | Data Memory | 
301h 301h 
ACC ACC 

Cc C 
TREGI TREG! 
Example 2 LACT *-—,AR3 ;(SXM = 1) 

Before Instruction After Instruction 


a | | 
ant aR 


Data Memory Data Memory | | 
310h OFFOOh} 310h ____QFFOOh 
ACC 098F7EC83h ACC OFFFFFEOOh 
Cc C 


mest = (est | (Ca 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Direct: [label] LAMM dma 
Indirect: [label] LAMM {ing} [,next ARP 


0<dmas 127 
0 =< next ARP < 7 


15 14 13 12 
Direct:| O 0 0 O 


15 14 13 12 


1i_ 10 9Q 


7 


6 5 4 


3.2 1 #O 


Data Memory Address | 


6 5 4 


3. 2 1 £O 


Indirect:| 0 O 0 0 1 0 0 0 See Subsection 4.1.2 


(PC) +1 — PC 
(dma) — ACC 


Not affected by SXM. 


The lower half of the accumulator is loaded with the contents of the addressed 
memory-mapped register. The upper half of the accumulator is zeroed. The 
9 MSBs of the data memory address are set to zero, regardless of the current 
value of DP or the upper 9 bits of AR(ARP). This instruction allows any location 
on data page zero to be loaded into the accumulator without modifying the DP 
field in status register STO. 


1 


Direct: [label LAMM dma 
Indirect: [label LAMM {ina} [,next ARP 


Cycle Timings for a Single Instruction | 
ee 
Dpaard MUPOFT ieipi 


t Add one more cycle for peripheral memory mapped access. 
+ Add n more cycles for peripheral memory mapped access. 


LAMM BMAR ;(DP = 6) 
Before Instruction 
ACC 
BMAR 
Data Memory 
31Fh 


ACC 
BMAR 


Data Memory 
31Fh 


Example 2 LAMM * 


ARP 
AR1 
ACC 
PRD 


Data Memory 
325h 


Before Instruction 


22221376h 


After Instruction 


ARP feel 

ARI [3 25h] 

ACC 

PRD 
Data Memory 

325h 


Note that the value in data memory location 325h is not loaded into the accu- 
mulator. The value at data memory location 25h (address of the PRD register) 


is loaded. 
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Assembly Language Instructions 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Direct: 


indirect: 


Short: 


Long: 


Direct: [label LAR AR, dma 

Indirect: [label LAR AR, {ind} [,next ARP 
Short Immediate  [/abef| LAR AR, #k 

Long Immediate  [/abe LAR AR, #/k 


0<dmas 127 

0 < auxiliary register AR < 7 

0 < next ARP <7 

Osks 255 

0 < Ik s 65535 
11. 10 9 

ro 0000] ARXT TO Data Memory ‘dress 
1110 9 7 

i CC See Subsodtion 41.2 

14 10 9 7 
a a a a 


; 2 1 A | 
; ; ; : -e - 
16-Bit Constant 


t See Section 4.5. 


Direct or Indirect Addressing: 

(PC) +1 — PC 

(dma) — auxiliary register AR 

Short Immediate Addressing: 

(PC) +1 — PC 

k — auxiliary register AR 

Long Immediate Addressing: 

(PC) +2 — PC 

Ik — auxiliary register AR 

Affected by NDX. 

The contents of the specified data memory address or an 8-bit or 16-bit con- 
stant are loaded into the designated auxiliary register (AR). The specified con- 
stant is acted upon like an unsigned integer, regardless of the value of SXM. 


If the NDX bit of the PMST register is 0, then ARCR and INDX registers are 
also loaded to maintain compatibility with the 'C2x. 


The LAR and SAR (store auxiliary register) instructions can be used to load 
and store the auxiliary registers during subroutine calls and interrupts. If an 
auxiliary register is not being used for indirect addressing, LAR and SAR en- 
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able the register to be used as an additional storage register, especially for 
swapping values between data memory locations without affecting the con- 


tents of the accumulator. 

Words 1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 

Cycles Direct: [labe] LAR AR, dma 
Indirect: [label LAR AR, {ina} [,next ARP] 


Cycle Timings for a Single Instruction 


| Source SARAM : 2n 2N+Pcode 
2n+it 


2n+-1+Ndere#Poode 


t If the source operand and the code are in the same SARAM block. 


Short Immediate _[/abel] LAR AR, #k 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


Long |mmediate _[/abel] LAR AR, #/k 


Cycle Timings for a Single Instruction 


Not Repeatable 
Example 1 LAR ARO,DAT16 ;(DP = 6) 
Before Instruction After Instruction 
Data Memory Data Memory 
310h 310h 


ARO ARO 
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Load Auxiliary Register LAR 


Example 2 LAR AR4, *— 
Before instruction After Instruction 
ARP eS C7 
Data Memory Data Memory 


300h Ca] 00 
AR ——— sae 


Note: 


LAR in the indirect addressing mode ignores any AR modifications if the AR 
specified by the instruction is the same as that pointed to by the ARP. There- 
fore, in Example 2, AR4 is not decremented after the LAR instruction. 


Example 3 LAR AR4,#01h 
Before Instruction After instruction 
AR4 OFFO9h AR4 | OTH 
Example 4 LAR AR4,#3FFFh 
Before Instruction After Instruction 


Ara ARs FFF 
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Syntax Direct: [label LDP dma 
Indirect: [label LDP {ind} [,next ARF 
Short Immediate: [labe] LDP #k 


Operands 0 <dmas 127 
0 = next ARP s 7 
O0sk<511 
Opcode 


9 
0 
15 14 13 12 11 #10 9 8 7 
0 0 O 1 0 1 


15 14 13 12 11 + #10 


8 7 6 5 4 83 2 1 ~=«0 


Execution (PC) +1 — PC 


Direct or Indirect Addressing: 
Nine LSBs of (dma) — data page pointer (DP) status bits 


Short Immediate Addressing: 
k — data page pointer register (DP) status bits 


Affects DP. 


Description The nine LSBs of the contents of the addressed data memory location or a 9-bit 
immediate value are loaded into the DP register. The DP and 7-bit data 
memory address are concatenated to form 16-bit data memory addresses. 
The DP can also be loaded by the LST instruction. 


Words 1 
Cycles Direct: [label LDP dma 
Indirect: (label LDP {ina} [,next ARP 


Cycle Timings for a Single Instruction | 


eee eee 2 


Source DARAM | 


Source SARAM ) 
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Cycle Timings for a Repeat (RPT) Execution 
Source DARAM ee 


Source SARAM 2n 2N+Pcode 
2n+it 


anendere | 2n+1+ndsrcPoode 


t If the source operand and the code are in the same SARAM block. 


[label] LDP #k 


Short Immediate: 


Cycie Timings for a Single instruction 


Not Repeatable 
Example 1 LDP DAT127 ;(DP = 511) 
Before instruction After Instruction 
Data Memory Data Memory 
OFFFFh OFFFFh 
DP DP 
Example 2 LDP #0h 
Before Instruction After instruction 
DP DP 
Example 3 LDP *, AR5 
Before Instruction After Instruction 
ARP Lo ARP eee 
ARS ARS 
Data Memory Data Memory 
300h 300h 
DP DP 
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Syntax Direct: [label] LMMR ama, #addr 
Indirect: [label] LMMR {ind}, #adar [,next ARP 


Operands 0s dmas 127 
0 s next ARP s 7 
0 < addr = 65535 
Opcode 
14 11 10 9 


pices Pi ooo 100 spo] Daa Maron Adee 
1 


16-Bit Constant 
15 14 13 12 11109 8 


| ee A = care 2 
Indirect: 


| 16- Bit Constant | 


Execution PFC — MCS 

(PC) +2 — PC 

Ik — PFC 

While (repeat counter = 0): 
(src, addressed by PFC) — (dst, specified by lower 7 bits of dma) 
(PFC) +1 —> PFC 
(repeat counter) - 1 — repeat counter 

MCS — PFC 


Description The memory-mapped register pointed at by the lower 7 bits of the directly or 
indirectly addressed data memory value is loaded with the contents of the data 
memory location addressed by the 16-bit address, addr. The 9 MSBs of the 
data memory address are set to zero, regardless of the current value of the 
data page pointer (DP) or the upper 9 bits of AR(ARP). This instruction allows 
any memory location on data page zero to be accessed without modifying the 
DP field in status register STO. 


When using the LMMR instruction with the RPT instruction, the source ad- 
dress, #addr, is incremented after every memory-mapped load. 


Words 2 


Cycles Direct: [label LMMR ama, #adadr 
Indirect: §[/abel| LMMR {ina}, #adadr [,next ARP 


Cycle Timings for a Single Instruction a 


CE 1 2 
tals an 2+2Pcode 
eos — 2+2Pcode 
PE 
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ee Load Memory-Mapped Register LMMR 


ateatal atasic atatstes seateratVeatatehCatativa atecatocale Heh ¥ ster ecsrersrelseucecnc seas ocpcacote ol egler siete eearstorenstolsle.ctafecstahe ahs Se 8c eiGasterstes GiGi Soot Natta aeta te auetets Chae aty nl LOS SHES E A cieter gals nuslateictslaieiNies cei almate Cage Malthe f state dtectie gletela ta ati titie le WCRI Lseistatatcigtsty Nee at) oN nae Fete Ral ett Sat caailese Cans en ATe Ngee ect Rakin ais oe tan hy 


| Cycle Timings for a Single Instruction (continued) 
ec, ca 2 


- Source Ext 2+Psrc , 2+Psrc 2+Psre 3+Psro+2Pcode 
Destination MMRt 


Source SARAM | | 
Source Ext _ 7] 2ntndere 2ntnderc | 2Qntndse «|. 2N+1+ Ndgret2Peode 
onions fp fe 


Source DARAM 


Source SARAM 3N+NiOgst | 3N+NiOgsz 7 3N+NIOgst 3N+3+NIOgs¢+2Pcode 
Destination MMPORT | 3N+1 +nidges? 


| Source Ext — | 4n—1+ndeet | 4n—1+ndeo+ | 4n—1tndeo+ | 4n+2+ndee+ 
| Destination MMPORT | NlOgst | Nidgst | NiOgst NiOgst+2Pcode 


t If the source operand and the code are in the same SARAM block. 
+ Add one more cycle if peripheral memory mapped register access. 
§ Add n more cycles if peripheral memory mapped register access. 


Source DARAM | | 
Destination MMRS§ 


Example 1 LMMR DBMR,#300h 
Before Instruction | After instruction 
Data Memory Data Memory | 
300h 300h 
DBMR DBMR 
Example 2 LMMR *,#300h,AR4 *CBCR = ILEh 
Before Instruction After Instruction 
ARP =e) ARO 
ARO [seh] «ARO ES= 
Data Memory | Data Memory 
300h 300h 
CBCR CBCR 


Syntax Direct: [label LPH dma 
Indirect: [label LPH {ind} [,next ARF 


Operands 0 <dma sx 127 
0 < next ARP <7 


Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 # OQ 


Direct; 0 1 1 14 0 #14 0 1 Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 OQ 
Indirect:} O 1 #14 #1 0 #4 #90 4 


Execution (PC) +1 —- PC 
(dma) — P register (81-16) 


Description The P register high-order bits are loaded with the contents of data memory. 
The low-order P register bits are unaffected. 


The LPH instruction can be used for restoring the high-order bits of the P regis- 
ter after interrupts and subroutine calls if automatic context save is not used. 


Words 1 
Cycles Direct: [label] LPH dma 
Indirect: [abe] LPH {ind} [,next ARP] 
Cycle Timings for a Single Instruction 
Operand DARAM 14> 
Operand SARAM 1 1+p 
ot 
Operand Ex 
Cycle Timings for a Repeat (RPT) Execution 
Operand DARAM a ee 
Operand SARAM n n+p 
n+1t 
Operand Ext 
Example 1 LPH DATO ;(DP = 4) 
Before Instruction After instruction 
Data Memory Data Memory 
200h 200h 
P P 
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Load Product High Register LPH 


Example 2 LPH * ,AR6 
Before Instruction After Instruction 


app ARP 
ARS ans 


Data Memory Data Memory 
200h OF79Ch 200h OF79Ch] 
P 30079844h P | OF79C9844h 
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Syntax Direct: [label] LST #n, dma 

Indirect: [label] LST #n, {ind} [,next ARP 
Operands 0 <dmas< 127 

n=0,1 

0 < next ARP <7 
Opcode 

LST #0 


15 14 13 12 11 10 9 8 7 6 § 4 3 2 ij OQ 


Direct!} O O O O 1 1 0 Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 


Indirect:| 0 O O O 1 1 Oj} 1) See Subsection 4.1.2 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 Q 


Direct} 0 0 0 0 1 1 #71 «1 Data Memory Address 


15 14 13 12 11 10 9 8 7 6 § 4 3 2 ji QO 


Indirect} 0 O O 0 14 1 #1 #44 44 See Subsection 4.1.2 


Execution (PC) +1 — PC 
(dma) -—> status register STn 
dma (bits 13-15) —> ARP (regardless of n) 


Affects ARB, ARP, OV, OVM, DP, CNF, TC, SXM, C, HM, XF, and PM. 
Does not affect INTM. 


Description Status register STn is loaded with the addressed data memory value. Note that 
the INTM bit is unaffected by LST #0. In addition, the LST #0 instruction does 
not affect the ARB field in the ST1 register even though a new ARP is loaded. 
Ifa next ARP value is specified via the indirect addressing mode, the specified 
value is ignored. Instead, ARP is loaded with the value contained within the 
addressed data memory word. 


Note: 
When ST1 is loaded, the value loaded into ARB is also loaded into ARP. 
| eee 
The LST instruction can be used for restoring the status registers after subrou- 
tine calls and interrupts. 
Words 


onl, 
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Cycles Direct: [label LST #n, dma 
Indirect: [label] LST #n, {ind} [,next ARP] 


| | Cycle Timings for a Single Instruction 


2+Dcode 


2 2+Pcode 
gt 
Sauce Fete 


Cycle Timings for a Repeat (RPT) Execution 


Source SARAM 


i 
4 
i 


a [See 5 ee fc ee © eee 

_SourceDARAM [fe re 
Source SARAM 2n 2N+Pcode 

aaa a ee 


t If the source operand and the code are in the same SARAM block. 


Example 1 MAR *,ARO 
LST #0,*,AR1 ;The data memory word addressed by the contents 
;0f auxiliary register ARO is loaded into 
;status register ST0O,except for the INTM bit. 
;Note that even though a next ARP value is 
;specified, that value is ignored, and the 
sold ARP is not loaded into the ARB. 


Example 2 LST #0,60h ;(DP = 0) 
Before Instruction After Instruction 
Data Memory Data Memory 
60h 60h 
To STo 
ST1 ST1 
Example 3 LST #0,*—,AR1 
Before Instruction After Instruction 
ARP ae: ARP 
AR4 AR4 
Data Memory Data Memory 
SFFh 3FFh [OEE 04h] 
sTo STo 
ST1 ST 
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Example 4 
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LST 


#1,00h 


Data Memory 
300h 


STO 
ST1 


:(DP = 6) 
Before Instruction 


0E1BCh 
0406h 
O9A0 


After Instruction 


Data Memory | 
i OEIBC 


300h 
STO 
ST1 


E406 


~ OE1BCh] 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Direct: [label] LT dma 
Indirect: [label LT {ina} [,next ARP 


O0<dma sx 127 
0 < next ARP <7 


15 14 13 12 11 10 9 8 7 6 5 4 


TREGO 


3 


2 


1 


0 


Direct} 0 1414 14 14 #0 0 41 1 Data Memory Address 


15 14 13 12 #11 #10 9 8 7 6 § 4 


3 


2 


1 


0 


indiect["0 11 1 0 0 1 1] 1]  SeeSubsection 4.12 


(PC) +1 —- PC 
(dma) — TREGO 
lf TRM = 0: 
(dma) — TREG1 
(dma) — TREG2 


Affected by TRM. 


TREGO is loaded with the contents of the specified data memory address 
(dma). The LT instruction may be used to load TREGO in preparation for multi- 
plication. See the LTA, LTD, LTP, LTS, MPY, MPYA, MPYS,and MPYU instruc- 
tions. If the TRM bit of the PMST register is 0, then TREG1 and TREG2 are 
also loaded to maintain compatibility with the ’C25. The TREGs are memory- 
mapped registers and may be read and written with any instruction that ac- 
cesses data memory. Note that TREG1 is only 5 bits and TREG2is only 4 bits. 


1 


Direct: [label LT dma 
Indirect: [label] LT {ind} [,next ARP| 


Cycle Timings for a Single Instruction 


Operand SARAM 


Operand rs 


Cycle Timings for a Repeat (RPT) Exe 


Operand DARAM 


Operand SARAM 


Operand Ex 


t If the operand and the code are in the same SARAM block. 
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Example 1 LT DAT24 ;(DP = 8. TRM = 1). 
Before Instruction 


Data Memory Data Memory 


“18h aieh 
Teco «= [3h] TREGO 


Example 2 LT *,AR3 ;(TRM = 0) 
Before Instruction 


AR2 AR2 
Data Memory Data Memory 


4i8h 4i8h 


TREGo = [3h] TREGO 
TREGI TREG! 


TREG2 [Sh] TREG2 


After instruction 


After instruction 


| 418 


TL 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Direct: [label LTA dma 
Indirect: [label] LTA {ina} [,next ARF 


0<dmas 127 
0 < next ARP <7 


15 14 13 12 #11 +#+10 =Q 7 6 5 4 3 2 1 =O 


15 14 13 #12 #111 #10 =Q9 7 6 5 4 3 2 1 =O 


Indiect:} 0 1 1 1 0 0 0 0 es See Subsection 4.1.2 


(PC) +1 —- PC 
(dma) — TREGO 
(ACC) + (shifted P register) - ACC 


Affected by OVM, PM, and TRM; affects OV and C. 

TREGO is loaded with the contents of the specified data memory address 
(dma). The contents of the product register, shifted as defined by the PM status 
bits, are added to the accumulator, with the result left in the accumulator. If the 
TRM bit of the PMST register is 0, then TREG1 and TREG2 are loaded with 


the same value as TREGO to maintain compatibility with the C25. Note that 
TREG1 is only 5 bits and TREG2 is only 4 bits. 


The function of the LTA instruction is included in the LTD instruction. 
1 


Direct: [label LTA dma 
Indirect: [label LTA {ind} [,next ARF 


| Cycle Timings for a Single instruction 


een aa hoe Eee 
Operand SARAM 1 1 1 
ot 


Cycle Timings for a Repeat (RPT) Execution 


TPR [POA [PSA] PE 
[OperandDARAM hp 


i 
Operand SARAM n | nep 
n+it 


“Operand Ex 


t If the operand and the code are in the same SARAM block. 
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Example 1 LTA DAT36 ;(DP = 6, PM = 0, TRM = 1) 
Before Instruction 
Data Memory Data Memory 
324h 34h 
TREGO [SH] TREGO 
P P 
ACC ee) ACC 
C 
Example 2 LTA *,5 :(TRM = 0) 
Before Instruction 
ARP ee 
AR4 AR4 
Data Memory Data Memory 
324h 324h 
TREGO [HI TREGO 
TREG1 TREG1 
TREG2 [SH TREG2 
P P 
ACC a) ACC 
Cc 
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Assembly Language Instructions 


After Instruction 


[0] 
C 


After Instruction 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Direct: [label] LTD dma 
Indirect: [label LTD {ina} [,next ARP 


0 <dmas 127 
0 < next ARP <7 


6 5 4 3 2 1 QO 


7 
0 | Data Memory Address 


8 
0 

15 14 13 12 11 #10 9 8 7 4 3 2 1 O 
0 


6 5 
Indirect} O 1 #1 #1 #0 O 1 at See Subsection 4.1.2 


(PC) +1 — PC 

(dma) — TREGO 

(dma) — dma + 1 

(ACC) + (shifted P register) —- ACC 


Affected by OVM, PM, and TRM; affects C and OV. 


15 14 13 12 11 #10 9 
Direct:} 0 1 1 #14 #0 0 1 


TREGO is loaded with the contents of the specified data memory address 
(dma). The contents of the P register, shifted as defined by the PM status bits, 
are added to the accumulator, and the result is placed in the accumulator. The 
contents of the specified data memory address are also copied to the next 
higher data memory address. If the TRM bit of the PMST register is 0, then 
TREG1 and TREG2 are also loaded to maintain compatibility with the ’C25. 
Note that TREG1 is only 5 bits and TREG2 is only 4 bits. 


This instruction is valid for all blocks of on-chip RAM configured as data 
memory. The data move function is continuous across the boundaries of con- 
tiguous blocks of memory but cannot be used with external data memory or 
memory-mapped registers. This function is described under the instruction 
DMOV. Note that if LTD is used with external data memory, its function is identi- 
cal to that of LTA. 


1 


Direct: [label] LTD dma 
Indirect: [/abel LTD {ind} [,next ARF 


Cycie Timings for a Single Instruction | 
[Operana param [TTT Tt 


Operand SARAM 


Operand Ext Bran 


Cycle Timings for a Repeat (RPT) Execution 


Ct a Ce 2 


Operand SARAM 2n-2 2n-2 2n-2+p 
2n+it 
Operand Ext 4n—-2+2nd | 4n-2+2nd | 4n-2+2nd | 4n+1+2nd+p 


t If the operand and the code are in the same SARAM block. 


Example 1 LTD DAT126 ;(DP = 7, PM = 0, TRM = 1). 

Before Instruction After Instruction 

Data Memory Data Memory 
3FEh 3FEh 

Data Memory Data Memory 
3FFh 3FFh 
TEGO = [3h]_—TREGO 
P P 
ACC eT) acc [0] 

Cc Cc 
Example 2 LTD *,AR3 ;(TRM = 0) 

Before instruction After Instruction 
ARP CC ARP a) 
AR1 AR1 [FEN 

Data Memory Data Memory 
SFEh 3FEh 

Data Memory Data Memory 
aFFh aFFh 
TREGO TREGO 
TREG! TREGI 
TREG2 TREG2 
P P OFh 
Ace Cc Face 

Cc Cc 


4-124 | Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Direct: [label LTP dma 
Indirect: [label] LTP {ina} [,next ARP 


0<dma sx 127 
0 < next ARP x 7 


15 14 13 12 11 #10 
Direct:| O 1 1 1 0 O 


6 5 4 3 2 1 9Q 


Data Memory Address 


8 7 
0 1] 0) 

15 14 13 12 1110 9 8 7 6 5 4 3 2 1 90 
0 1] 1] 


Indirect:| O 1 1 1 0 O 1 See Subsection 4.1.2 


(PC) +1 — PC 
(dma) — TREGO 
(shifted P register) -— ACC 


Affected by PM and TRM. 


TREGO is loaded with the contents of the addressed data memory location, 
and the product register is stored in the accumulator. The shift at the output 
of the product register is controlled by the PM status bits. If the TRM bit of the 
PMST register is 0, then TREG1 and TREG2 are also loaded to maintain com- 
patibility with the 'C25. Note that TREG1 is only 5 bits and TREG2is only 4 bits. 


_* 


Direct: [label LTP dma 
Indirect: [label LTP {ind} [,next ARP 


Cycle Timings for a Single Instruction 


Operand DARAM 


[PR POA 
an ce 
[ Operenee ed 


Cycle Timings for a Repeat (RPT) Execution 


ron ran 
opmansaRaT Tr ne ee 
ffm [LP 


Operand DARAM 
n 


Operand SARAM os 


t If the operand and the code are in the same SARAM block. 
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;(DP = 6, PM = 0, TRM = 1) 
Before Instruction 


;(PM = 0, TRM = 0) 


Before Instruction 


After instruction 


After Instruction 


Load TREGO and Subtract Previous Product LTS 


Syntax Direct: [label LTS dma 

Indirect: [label LTS {ina} [,next ARP| 
Operands 0 <dmas 127 

0 < next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 Q 


Direct:} O 1 #1 #1 #O 1 #O Of O| Data Memory Address 


eo 8 7 4 3 
indrect0 4110100] 1] Sea Subsecion aT2 
Execution (PC) +1 —- PC 


(dma) — TREGO 
ACC — (shifted P register) -> ACC 


Affected by PM, TRM, and OVM; affects OV and C. 


Description TREGO is loaded with the contents of the addressed data memory location. 
The contents of the product register, shifted as defined by the contents of the 
PM status bits, are subtracted from the accumulator. The result is placed in the 
accumulator. If the TRM bit of PMST is set to 0, the value is also loaded into 
TREG1 and TREG2 to maintain compatibility with the ‘C25. Note that TREG1 
is only 5 bits and TREG2 is only 4 bits. 


Words 1 


Cycles Direct: [label LTS dma 
Indirect: [label LTS {ina} [,next ARP 


Cycle Timings for a Single Instruction 


Operand DARAM 


a ee ee 
Ee 
ieee A EN 
ee ce 


Operand Ext 


0 0 
a 


Operand SARAM 


Operand Ext 


vend 


t If the operand and the code are in the same SARAM block. 
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LTS Load TREGO and Subtract Previous Product 


Example 1 LTS DAT36 ;(DP = 6, PM = 0, TRM = 1) 


Before Instruction 
Data Memory 


22th 
TEGO DF 
P 


ACC 
C 


Example 2 LTS *,AR2 ;(TRM = 0) 
Before Instruction 


en | 
AR 
22th 


mreco | DH 
TREGI 
mec2 = [3h 

P 
Acc 
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After instruction 


Data Memory | 
324h 
TREGO 
P 
acc [0] 
Cc 


TREGO 
TREG! ah 
TREG2 


Assembly Language Instructions 


RAO OCCUR LOE O ECTS REE OCS 


Syntax 


Operands 


Opcode 


Execution 


Description 


Multiply and Accumulate MAC 


enn atv wat ana aa aaa aaa a a ea aa aaa a aii iittala a i at a aati a aa atee itt aaa laa a fa h aa a a at a aa oa a ia at a a aa  at iat Wat  a a a eta ah l ethati ea e i t 


Direct: [label MAC pma, dma 
Indirect: [label] MAC pma, {ind} [,next ARP] 


0 < pma s 65535 
Osdmas 127 
0 < next ARP <7 


1514 #13 #12 +11 :°#10 9 8 7 6 5 4 3 2 #1 =O 


sic a 0 1 0 0 0 1 +«0 Data Memory Address 
16-Bit Constant 


15 14 13 12 11 :+#10 9 8 7 6 5 4 3 2 1 9 
0 oO 1 


1 0 1 0 0 See Subsection 4.1.2 
16-Bit Constant 


(PC) +2 — PC 
(PFC) — MCS 
(pma) — PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified 
(PFC) + 1 — PFC 
(repeat counter) — 1 -> repeat counter. 
Else (ACC) + (shifted P register) —» ACC, 
(dma) — TREGO 
(dma) x (oma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified 
(MCS) — PFC 


Affected by OVM, TRM, and PM; affects C and OV. 


Indirect: 


The MAC instruction multiplies a data memory value (specified by dma) by a 
program memory value (specified by pma). It also adds the previous product, 
shifted as defined by the PM status bits, to the accumulator. 


The data and program memory locations on the ’C5x may be any nonreserved, 
on-chip or off-chip memory locations. If the program memory is block BO of 
on-chip RAM, then the CNF bit must be set to one. When the MAC instruction 
is used in the direct addressing mode, the dma cannot be modified during rep- 
etition of the instruction. 


When the MAC instruction is repeated, the program memory address con- 
tained in the PFC is incremented by one during its operation. This makes it 
possible to access a series of operands in memory. MAC is useful for long 
sum-of-products operations because it becomes a single-cycle instruction, 
once the RPT pipeline is started. 


4-129 


MAC Multiply and Accumulate 


If the TRM bit of the PMST register is 0, then TREG1 and TREG2 are loaded 
with the same value as TREGO to maintain compatibility with the 'C2x. Note 
that TREG1 and TREG2 are only 5-bit, and 4-bit long, respectively. 


Words 2 


Cycles Direct: [label MAC pma, dma 
Indirect: [label] MAC pma, {ind} [,next ARP| 


Cycle Timings for a Single instruction 


Operand1 DARAM/ROM 3+2Pcode 
Operand2 DARAM 


Operand1 SARAM 
Operand2 DARAM 


Er, 
Operand2 DARAM 

Operand1 DARAM/ROM 3+2Dcode 

Operand1 SARAM 3 3 3 3+2Dcode 

Operand2 SARAM 4t 4t 4t 
Operand2 SARAM 

Operand2 Ext 

Operand1 SARAM 3+dop2 34+dop2 3+dop2 3+dop2+2Pcode 
Operand2 Ext 


Cycle Timings for a Repeat (RPT) Execution 


Operand1 DARAM/ROM | n+2 n+2 n+2 N+2+2Pcode 
Operand2 DARAM 
ADARA wi + it Nit<t+<Pcode 


Arinarandad O ' 
wPoOaHWU! OAT 


a n a 
ire ire ire 


Operand2 DARAM 


Operand1 Ext N+2+NPop7 N+2+NPop7 N+2+NPops N+2+NPop1+2Pcode 
comer | | 
Operand1 DARAM/ROM | n+2 n+2 n+2 N+2+2pcode 
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Cycle Timings for a Repeat (RPT) Execution (Continued) 
n+2 | 


ee 

Operand1 SARAM n+2 n+2 N+2+2Poode 
Operand2 SARAM 2n+2t 2n+2t 2n+2t 
Operand2 SARAM 
Operand2 Ext | | 
| Operand1 SARAM n+2+ndono n+2+ndono “+2+ndop2 N+2+ndono+2Pcode 
eee i i ie 

Operand1 Ext 2N+24+NPopitN | 2N+2+NPop7+N | 2N+2+NPop7+ 2N+2+NPop1 +Ndopo+ 


t If both operands are in the same SARAM block. 


Example 1 MAC OFFO0h,02h ;(DP = 6, PM = QO, CNF = 1) 
Before Instruction After Instruction 
Data Memory Data Memory 
302h 302h 


Program Memory Program Memory 


FFOOh FFOOh 
TREGO TREGO [2h 


P P 
ACC acc [0] 
Cc Cc 

Example 2 MAC OFF00h,*,AR5 ;(PM = 0, CNF = 1) 
Before Instruction After Instruction 
ARP er ee CF 
AR ARS 

Data Memory Data Memory 
302h 302h 23h] 
Program Memory , Program Memory | 

FFOOh FFOOh 
TREGO 45h TREGO [Bh] 
P [____458972h] P 


ACC 723EC41h acc [o] 76975B3h 
Cc Cc 
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Syntax Direct: [label MACD pma, dma 

Indirect: [label] MACD pma, {ind} [,next ARP 
Operands 0 = pma s 65535 

0=<dmas 127 

0 <s next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 § 4 3 #2 1 0 
Direct: 1 | QO. 1 0 0 O 1 1 0 | Data Memory Address 
) 16-Bit Constant | 


15 14 13 12 1110 9 8 7 6 5 4 3 2 1 +0 
Indirect{_1.9 1 0 O Oo 1 1] 1 See Subsection 4.1.2 
16-Bit Constant 


Execution (PC) +2 — PC 
(PFC) — MCS 
(pma) — PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1—- PFC 
(dma) — (dma) + 1 
(repeat counter) — 1 — repeat counter. 
Else (ACC) + (shifted P register) —» ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register 
(dma) — (dma) + 1 
Modify AR(ARP) and ARP as specified, 
(MCS) —> PFC 


Affected by OVM and PM; affects C and OV. 


Description The MACD instruction multiplies a data memory value (specified by dma) by 
a program memory value (specified by pma). It also adds the previous product, 
shifted as defined by the PM status bits to the accumulator. The data and pro- 
gram Msmory iocations on ine ‘C5x may be any nonreserved, on-cnip or 
off-chip memory locations. If the program memory is block BO of on-chip RAM, 
then the CNF bit must be set to one. When MACD is used in the direct address- 
ing mode, the dma cannot be modified during repetition of the instruction. If 
MACD addresses one of the memory-mapped registers or external memory 
as a data memory location, the effect of the instruction will be that of a MAC 
instruction (see the DMOV instruction description). 
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If the TRM bit of the PMST register is 0, TREG1 and TREG2 are loaded with 
the same value as TREGO to maintain compatibility with the 'C2x. Note that 
TREG1 and TREG2 are only 5 bits and 4 bits long, respectively. 


MACD functions in the same manner as MAC, with the addition of data move 
for on-chip RAM blocks. Otherwise, the effects are the same as for MAC. This 
feature makes MACD useful for applications such as convolution and trans- 
versal filtering. 


When the MACD instruction is repeated, the program memory address con- 
tained in the PFC is incremented by one during its operation. This permits 
accessing a series of operands in memory. When used with RPT, MACD be- 
comes a single-cycle instruction once the RPT pipeline is started. 


Words 2 


Cycles Direct: [label MACD pma, dma 
Indirect: [labe] MACD pma, {ind} [,next ARP| 


Cycle Timings for a Single Instruction 


Operand1 SARAM 3+2Pcode 
Operand2 DARAM 


Operand1 DARAM/ROM 
Operand2 DARAM 


Operand1 Ext 3+Pop1 3+Pop1 3+Pop1 
Operand2 DARAM 
Operand1 DARAM/ROM 


Operand2 SARAM 


Operand1 SARAM 
Operand2 SARAM 


| Operand1 Ext 3+Pop1 3+Pop1 3+Pop1 

Operand1 DARAM/ROM 3+dop2 3+dop2 3+don2 34+dop2+2Pcode 
Operand2 Ext? 

Operand2 Ext? 


Cycle Timings for a Repeat (RPT) Execution 


n+2 | 


Operand1 DARAM/ROM | n+2 n+2 N+2+2Doode 
Operand2 DARAM 
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MACD _ Multiply and Accumulate With Data Move 


_ Cycle Timings for a Repeat (RPT) Execution (Continued) 


n+2 N+2+2Pcode 


N+2+NPop7 N+2+NPop7 


! Operand1 SARAM 
| Operand2 DARAM 


N+2+NPop1+2Pcode 


| Operand1 Ext 
Operand2 DARAM 


Operand1 SARAM 


2N+2Pcode 


2n+2Pcode 


Operand2 SARAM 3nt 


Operand1 Ext | 
Operand2 SARAM 

Operand1 SARAM | N+2+Ndop2 n+2+ndopo | nrBandoge | 
Operand2 Ext! | 


Operand1 Ext 2n+2+npops+N | 2n+2+npopstn | 2n+2+nPopy+n | 2n+2+nPops+Ndono+ 
Operand2 Ext! dop2 dop2 2Pcode 

t If operand2 and code are in the same SARAM block. 

+ If both operands are in the same SARAM block. 

§ If both operands and code are in the same SARAM block. 


1 Data move operation is not performed when operand2 is in external data memory. 


2N+NPop1t+2Pcode 


“N+24+Ndop2+2Pcode 


Example 1 MACD OFFO00h,08h ;(DP = 6, PM = 0, CNF = 1). 
Before Instruction After instruction 

Data Memory Data Memory 
308h [rh] 8h 3h] 

Data Memory Data Memory 
309h 309h 

Program Memory Program Memory 
FFOOh FFOOh 
TREGO TREGO C23] 
P 458972h| P 
ACC iy] f FOIECATHI AC [rl f FeO7ERAI 
ftw ww ell t icant MSR ti ll | Td we L~J l wut beathtelal | 
Cc C 
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Multiply and Accumulate With _MACD 


Example 2 MACD OFFO0Oh,*,AR6 ;(PM = 0, CF = 1) 

Before Instruction After instruction 
ARP Ci (tC ARP [gd 
ARS ARS 

Data Memory Data Memory 
308h =) 

Data Memory Data Memory 
309h 309h a 

Program Memory Program Memory 

FFOOh FFOOh 
TREGO TREGO [23h] 
P P 
ACC aco 


Cc C 


Note: The data move function for MACD can occur only within on-chip data memory RAM 
blocks. 
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Syntax 


Operands 


Opcode 


Execution 


Description 
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Direct: [label MADD dma 

Indirect: [label] MADD {ind} [,next ARF 

0 < dma s 127 

O < next ARP <7 

> ; 7 - 8 7 3. 2 1 Q 
Direct: Ca a a ee Data Memo Address 
i 2 = 12 "| 8 7 

est 1 p10 10 31111  Seosiomn ata 

(PC) +2 — PC 

(PFC) — MCS 


(BMAR) -» PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) — ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1-—» PFC 
(dma) — (dma) + 1 
(repeat counter) — 1 —> repeat counter. 
Else (ACC) + (shifted P register) —» ACC, 
(dma) —- TREGO 
(dma) x (pma, addressed by PFC) — P register 
(dma) — (dma) + 1 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


Affected by OVM, TRM, and PM; affects C and OV. 


The MADD instruction multiplies a data memory value (specified by the dma) 
by a program memory value. The program memory address is contained in the 
BMAR register; it is not specified by a long immediate constant. This facilitates 
dynamic addressing of coefficient tables. In addition, the previous product, 
shifted as defined by the PM status bits, is added to the accumulator. The data 
and program memory locations on the 'C5x may be any nonreserved, on-chip 
or off-chip memory locations. If the program memory is block BO of on-chip 
RAM, then the CNF bit must be set to one. When the MADD instruction is used 
in direct addressing mode, the dma cannot be modified during repetition of the 
instruction. If MADD addresses one of the memory-mapped registers or exter- 
nal memory as a data memory location, the effect of the instruction is that of 
a MADS instruction (see the DMOV instruction description). 


MADD functions in the same manner as MADS, with the addition of data move 
for on-chip RAM blocks. Otherwise, the effects are the same as for MADS. This 


Assembly Language Instructions 


Multiply and Accumulate With Data Move and Dynamic Addressing MADD 


feature makes MADD useful for applications such as convolution and trans- 
versal filtering. 


If the TRM bit of the PMST register is 0, TREG1 and TREG2 are loaded with 
the same value as TREGO to maintain compatibility with the 'C2x. Note that 
TREG1 and TREG2 are only 5 bits and 4 bits long, respectively. 


When the MADD instruction is repeated, the program memory address con- 
tained in the PFC is incremented by one during its operation. This enables ac- 
cessing a series of operands in memory. When used with RPT, MADD be- 
comes a single-cycle instruction, once the RPT pipeline is started. 


Words 1 


Cycles Direct: [label MADD dma 
Indirect: [label] MADD {ind} [,next ARP 


Cycle Timings for a Single Instruction 


es 0c ee 2, 
Operand1 DARAM/ROM 2+Pcode 
oral a a 
Operand1 SARAM 2+Pcode 
jomanonon | | 
Operand2 DARAM 
Operand1 DARAM/ROM 2+Pcode 
eral a a 
Operand1 SARAM 
Operand2 SARAM 3+Pcode* 


Operand1 Ext 2+Pop1+Pcode 
er ae ee a 

Operand2 Ext! 

Operand2 Ext? | | 

Operand2 Ext? | 


Cycle Timings for a Repeat (RPT) Execution 


So PR PO PSCC CC‘i‘z’ 

Operand1 DARAM/ROM N+1+Pcode 
eed a a a 
Operand1 SARAM N+1+Pcode | 
common | | 
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MADD Multiply and Accumulate With Data Move and Dynamic Addressing 


Cycle Timings for a Repeat (RPT) Execution (Continued) 


Operand2 DARAM 
Operandi DARAM/ROM | 2n-1 2n-1 2n-1 2n—-1+Pcode 
ried aa a Ee 


Operand1 SARAM 2n-1 2n-1 2n—1+Pcode 
Operand2 SARAM 3n-1* 3n-1# 


3n—1* 
Operand1 Ext 2n+1+NPop1+ 
Operand2 Ext! Nope 


t If operand2 and code reside in same SARAM block. 

+ If both operands reside in same SARAM block. 

$ if both operands and code reside in same SARAM block. 

1 Data move operation is not performed when operand2 is in external data memory. 


2n+1+NPop1+ 
Ndop2 


2n+1+NPop7+ 
Ndop2 


2n+14+NPop7+Ndopo+ 
Pcode 


Example 1 MADD DAT7 ;(DP = 6, PM = 0, CNF = 1) 
Before Instruction After Instruction 
Data Memory Data Memory 
307h 307h 
Data Memory Data Memory 
308h 308h 
BMAR OFFOOh BMAR OFFOOh 


TREGO TREGO 


FFOOh C2} FFOOh (2h 

P P 

ACC 723EC41h Acc [0] 76975B3h 
C C 
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Example 2 MADD *,3 ;(PM = 0, CNF = 1) 


Before Instruction After instruction 
ARP [i‘(<‘aWtOtCé«C( ARP a) 
AR2 AR2 
Data Memory Data Memory 
307h 307h 
Data Memory Data Memory 
308h 308h 
BMAR BMAR 
TREGO TREGO 
FFOOh a) FFOOh a) 
P P 
ACG [ 723EC4th) acc [0] 
Cc Cc 


Note: The data move function for MADD can occur only within on-chip data memory RAM 
blocks. 
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Syntax 


Operands 


Opcode 


Execution 


Description 
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Direct: 


Indirect: 


Direct: [label] MADS dma 
Indirect: [label] MADS {ind} [,next ARP 


0 <dmas 127 
0 < next ARP <7 


15 14 13 12 11 #10 9 7 6 5 4 3 2 1 QO 


8 
1 0 1 0 1 0 14 Of 0| Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0O 


1 0 17 0 1 0 1 Of 1] See Subsection 4.1.2 


(PC)+1 —- PC 
(PFC) — MCS 
(BMAR) — PFC 


If (repeat counter) = 0: 
Then (ACC) + (shifted P register) —> ACC, 
(dma) —~ TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 PFC 
(repeat counter) — 1 — repeat counter. 
Else (ACC) + (shifted P register) —» ACC, 
(dma) — TREGO 
(dma) x (pma, addressed by PFC) — P register, 
Modify AR(ARP) and ARP as specified, 
(MCS) — PFC 


Affected by OVM, TRM, and PM; affects C and OV. 


The MADS instruction multiplies a data memory value (specified by dma) by 
a program memory value (specified by pma). It also adds the previous product, 
shifted as defined by the PM status bits, to the accumulator. The pmais speci- 
fied by the contents of the BMAR register, rather than by along immediate con- 
stant. This allows for dynamic addressing of coefficient tables. 


The data and program memory locations on the ’C5x may be any nonreserved, 
on-chip or off-chip memory locations. If the program memory is block BO of 
on-chip RAM, then the CNF bit must be set to one. When MADS is used in the 
direct addressing mode, the dma cannot be modified during repetition of the 
instruction. 


When the MADS instruction is repeated, the program memory address con- 
tained in the PFC is incremented by one during its operation. This makes it 
possible to access a series of operands in memory. MADS is useful for long 
sum-of-products operations because this instruction becomes a single-cycle 
instruction, once the RPT pipeline is started. 


Assembly Language Instructions 


If the TRM bit of the PMST register is 0, TREG1 and TREG2 are loaded with 
the same value as TREGO to maintain compatibility with the ’C2x. Note that 


TREG1 and TREG2 are only 5 bits and 4 bits long, respectively. 


Words 1 
Cycles Direct: [label] MADS dma 
Indirect: [label MADS {ina} [,next ARP| 


Cycle Timings for a Single Instruction 


Operand1 DARAM/ROM 
Operand2 DARAM 


2+Pcode 


Operand1 SARAM | 
Operand2 DARAM 


Operand2 DARAM 


2+Pcode | 


2+Pcode 
Operand2 SARAM 


Operand 1 SARAM 2 2 2 2+Dcode 
Operand2 SARAM 3t gt 3t 3+Pcode! 
Operand2 SARAM 

Operand! DARAM/ROM | 2+dop2 2+dop2 2+dop2 2+dop2+Pcode 
Operand2 Ext 

Operand1 SARAM 2+don2 2+dop2 2+don2 2+don2+Pcode 
Operand2 Ext 

Operand2 Ext 


Cycle Timings for a Repeat (RPT) Execution | 


Operand1 DARAM/ROM 
Operand2 DARAM 
ie 


| Operand1 SARAM n+1 n 
n+1 


N+1+Pcode 


Operand2 DARAM | 
Operand1 DARAM/ROM n n n+1+Dcode | 
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n+1 n+1 
+1 +1 1 on+1+Pcoge 
| Operand2 DARAM 
+1 +1 ! 


. Operand1 SARAM | n+1+Poode 
Operand2 SARAM 2n+it 
Operandi Ext 


| Operand2 SARAM 

_ | Operand1 DARAM/ROM 

| Operand2 Ext 

| Operand1 SARAM | 

| Operand2 Ext 
Operandi Ext _ 
Operand2 Ext 


t If both operands are in the same SARAM block. 


Example 1 MADS DAT12 ;(DP = 6, PM = 0, CNF = 1). 
Before Instruction After Instruction 
Data Memory | | Data Memory | 

30Ch 30Ch 
BMAR BMAR 
TREGO TREGO 

Program Memory Program Memory 
FFOOh Lh] FFOOH Ch 
P [458972h P 
ACC acc [0] 

Cc Cc 

Example 2 MADS *,AR3 ;(PM = 0, CNF = 1) 

Before Instruction After Instruction 
ARP [J ARP 
AR2 AR2 
Data Memory Data Memory _ 
30Ch 30Ch 
BMAR BMAR [___OFFOOh| 
TREGO a = TREGO [—S—=« 

Program Memory | Program Memory 
FFOOn Fron Ca 
P [458972h p 


ACC 723EC41h acc. [o] 76975B3h 
Cc Cc 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 


Cycles 


Example 1 


meee ntiiean eat e ieee aie CMG RPC C EEE EE Een en eae he Tere nC CE ER COTE REN erent ietistna’ Mra teesrrrccratstmstyratitytrerte teeters aarp eee tie tetrtrtraerre riers ateirctatyerrrtatet rien rte ce Patreitrtetescstapta crtrtpteetetrtarretetrrtatrtetetstrtetethrtrtetietetetetyerteraatracettrtrtristitatrtatet teterticetete etary: Aine tree ntentete tye retreats retire rT Terre TAT ETT TT 


MAR 


Direct: [label MAR dma 
Indirect: [label| MAR {ina} [,next ARP 


0 < next ARP <7 


15 14 13 12 #11 £10 9 8 6 3.2 1 =O 


7 5 4 
Direct} 1 0 0 O 1 0 1 #1/]0— Data Memory Address 
7 


15 14 13 12 11 10 +9 8 6 5 4 3 2 1 +O 


Re See Subsection 4.1.2 | 


Indirect: 


(PC) +1 — PC 


Modifies ARP, AR(ARP) as specified by the indirect addressing field. Acts as 
a NOP in direct addressing mode. 


Affected by NDX. 


In the indirect addressing mode, the auxiliary registers and the ARP are modi- 
fied; however, no use is made of the memory being referenced. Note that if the 
NDX bit of the PMST register is 0 and the auxiliary register 0 (ARO) is modified, 
then the ARCR and INDX registers are also modified in the same way to main- 
tain compatibility with the 'C2x. Note that TREG1 and TREG2 are only 5 bits 
and 4 bits long, respectively. MAR modifies the auxiliary registers or the ARP, 
and the old ARP is copied to the ARB field of the status register ST1. Any oper- 
ation that MAR performs can also be performed with any instruction that sup- 
ports indirect addressing. ARP can also be loaded by an LST instruction. The 
instruction LARP from the ’C25 instruction set is a subset of MAR (that is, MAR 
*.4 performs the same function as LARP 4). 


| 


Direct: [label MAR dma 
Indirect: [label] MAR {ind} [,next ARP 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
a ee ee ee 


MAR *,AR1 ;Load the ARP with 1. 
Before Instruction After Instruction 
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Example 2 
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*+,AR5 ;Increment current auxiliary register 
;(AR1) and load ARP with 5. 


Before Instruction After Instruction 
ARI ARI [35h] 
ARP Ct) ARP CS 
ARB a) ARP rae) 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Direct: [label MPY dma 

Indirect: [label MPY {ina} [,next ARP 
Short Immediate: [label] MPY #k 

Long Immediate: [label] MPY #/k 


0<dma <= 127 

0 = next ARP <7 
—4096 =< k < 4095 
—32768 s Ik < 32767 


Multiply data value times TREGO 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #@Q 
0 


Direct:} O 1 0 1 0 1 =O Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 
Indirect} O 1 O 1 O 1 0 Of 1. See Subsection 4.1.2 


Multiply TREGO by 13-bit immediate 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 = 0 


Short:| 1 1 0 13-Bit Constant | 


Multiply TREGO by long immediate 
15 14 #13 #12 #11 :#10 9 8 7 6 § 4 3 2 1 JO 


1 O 1 #1 #1 =1 ~=1 0 1 0 0 0 0 0 0 O 


Long: 
| 16-Bit Constant | 


If indirect or direct addressing: 


lf short immediate value specified: 


(PC)+1 —- PC 
(TREGO) x k — P register 


If long immediate value specified: 


(PC) +2 — PC 
(TREGO) x Ik — P register 


The contents of the TREGO register are multiplied by the contents of the ad- 
dressed data memory location. The result is placed in the P register. Short im- 
mediate addressing multiplies TREGO by a signed 13-bit constant. The short 
immediate value is right-justified and sign-extended before the multiplication, 


regardless of SXM. 
1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 
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MPY Multip/ 


Cycles Direct: [label] MPY dma 
Indirect: [label MPY {ind} [,next ARP 


Cycle Timings for a Single Instruction | | 
Operand DARAM } 


PR 
od 


Operand Ext 


Operand SARAM 


[PR [POA 
[SperenaDaRAM af 


Operand Ex 


t If the operand and the code are in the same SARAM block. 


Short Immediate: [label] MPY #k 
Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


Long Immediate: [label MPY #/k 


Cycle Timings for a Single Instruction 
its (RL rn 
el Ke 


Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


Example 1 MPY DAT13 ;(DP = 8) 
Before Instruction After instruction 
Data Memory Data Memory 
40Dh 40Dh 
TREGO TREGO 
P P 2A 
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Example 2 


Example 3 


Example 4 


AR1 


Data Memory 
40Dh 


TREGO 


MPY #031h 


MPY #01234h 


TREGO 
p 


Before Instruction 


= = 


Before instruction 


| 


Before Instruction 


ARP 
AR1 


Data Memory 
40Dh 


TREGO 
P 


TREGO 


TREGO 


After instruction 


40D 


We 


2A 


After instruction 


| 
> 


After instruction 


| 


2468 
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Syntax Direct: [label MPYA dma 
Indirect: [label MPYA {ind} [,next ARP 


Operands 0 < dma s 127 
0 <s next ARP <7 


Opcode 
6 5 4 3 2 1 QO 


7 
| 07 Data Memory Address 


8 
0 

15 14 13 12 11 10 9 8 7 6 § 4 3 2 jij QO 
0 


Indirect:[ 0 1 0 1 0 0 0 | 1 | See Subsection 4.1.2 


Execution (PC) +1 — PC 
(ACC) + (shifted P register) — ACC 
(TREGO register) x (dma) — P register 


Affected by OVM and PM; affects C and OV. 


15 14 13 12 11 410 
Direct:| O 1 0 1 0 O 


Description The contents of TREGO are multiplied by the contents of the addressed data 
memory location. The result is placed in the P register. The previous product, 
shifted as defined by the PM status bits, is also added to the accumulator. 


Words 


Cycles Direct: [label] MPYA dma 
Indirect: §[label) MPYA {ina} [,next ARF 


on, 


Cycle Timings for a Single Instruction 


Operand SARAM 


Operand Ext 


T If the operand and the code are in the same SARAM block. 
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;(DP = 6, PM = 0) 
Before Instruction 


;(PM = 0) 


Before Instruction 


WU 


After instruction 
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Syntax Direct: [labe} MPYS dma 
Indirect: [label MPYS {ind} [,next ARP 


Operands 0O<dmas 127 
0 < next ARP <7 


Opcode 
15 14 13 12 11°10 9 8 7 
| 0 1 0 1 0 O 1] 0— 
15 14 13 12 11 =#10 Q 

0 


| 87 
fo 1 0 1 0 0 0 1{1) 


6 5 4 3 2 1 Q 
Data Memory Address 


6 5 4 3 2 1:0 


Indirect 

Execution (PC) +1 — PC 
(ACC) — (shifted P register) -- ACC 
(TREGO) x (dma) — P register 
Affected by OVM and PM; affects C and OV. 

Description The contents of TREGO are multiplied by the contents of the addressed data 
memory location. The result is placed in the P register. The previous product, 
shifted as defined by the PM status bits, is also subtracted from the accumula- 
tor, and the result is placed in the accumulator. 

Words 1 

Cycles Direct: [label MPYS dma 


Indirect: [label MPYS {ind} [,next ARP 


Cycle Timings for a Single Instruction 

PPR] PDA | PSA CL 

i 

at | 

a 
PR P 


Fs — 
ee | a 


Operand DARAM 


ae Fe ae 
ee ee 
| n+it 


Operand Ex [rend _[ rend 


t If the operand and the code are in the same SARAM block. 
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Example 1 MPYS DAT13 ;(DP = 6, PM = 0) 
Before Instruction 


Data Memory 
30Dh 


TREGO 


Example 2 MPYS *,AR5 ;(PM = 0) 
Before instruction 
ARP fae eee 
AR4 
Data Memory 
30Dh 
CGH 


TREGO 


Multi oly and Subtract Previous Product 


MPYS 


After instruction 


Data Memory 
30Dh 
TREGO 
p 
acc [4] Cth) 


AR4 


Data Memory 
30Dh 
TREGO 
P 
acc [4] L____ ten 
C 
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Syntax Direct: [label] MPYU dma 
Indirect: [label MPYU {ina} [,next ARP 


Operands 0<dmas 127 
0 = next ARP <7 


Opcode 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 @Q 


Direct:| 0 1 OO 1 1 0 140) Data Memory Address 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 °=0 
0 o 1/1] See Subsection 4.1.2 


Indirect:| O | 0 1 1 


Execution (PC) +1 — PC 
Unsigned (TREGO) x unsigned (dma) — P register 


Not affected by SXM. 
Description The unsigned contents of TREGO are multiplied by the unsigned contents of 
the addressed data memory location. The result is placed in the P register. The 


multiplier acts as a signed 17 x 17-bit multiplier for this instruction, with the 
MSB of both operands forced to zero. 


The shifter at the output of the P register will always invoke sign-extension on 
the P register when PM = 3 (right-shift by 6 mode). Therefore, this shift mode 
should not be used if unsigned products are desired. 


The MPYU instruction is particularly useful for computing multiple-precision 
products, such as when multiplying two 32-bit numbers to yield a 64-bit prod- 
uct. 


Words | 


Cycles Direct: [label MPYU dma 
Indirect: [label MPYU {ina} [,next ARP 


Cycle Timings for a Single Instruction 
Operand DARAM 


era Ee eee 
ee ee | 
Operand SARAM | q +p 
ot 
Operand Ext | t4d | tad | 1+d 


a 


Cycle Timings for a Repeat (RPT) Execution 


a 
een 


nnd 


=| 

+ 

awh, 
+ 


T If the operand and the code are in the same SARAM block. 
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Example 1 MPYU DAT16 ;(DP = 4) 
Before Instruction 
Data Memory 
210h [oF FFFH 
TREGO 
P Pecinceeee ae 
Example 2 MPYU ~*,AR6 
Before Instruction 
ARP Ld 
ARS 
Data Memory 
210h 
TREGO 
P |) 


Data Memory 
210h 


TREGO 
p 


ARP 
ARS5 


Data Memory 
210h 


TREGO 


OFFFEOOOt1h 


After Instruction 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Example 1 


Example 2 
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NEG Negate Accumulator 


[abe] NEG 


None 


15 14 #13 #12 #11 £10 9 8 7 6 5 4 3 2 1 °9O 
1 O 1 71 +71 #1 ~ «1 0 0 0 0 0 0 0 1 =O 


(PC) +1 — PC 
(ACC) x -1 - ACC 


Affected by OVM; affects OV and C. 


The contents of the accumulator are replaced with its arithmetic complement 
(twos complement). The OV bit is set when taking the NEG of 80000000h. If 
OVM = 1, the accumulator contents are replaced with 7FFFFFFFh. If OVM = 
O, the result is 80000000h. The carry bit C on the ’C5x is reset to zero by this 
instruction for all nonzero values of the accumulator, and is set to one if the 
accumulator equals zero. 


1 


[label NEG 


CPR PBA PSA OPE 
a A 
A 


NEG ;(OVM = X) 


Before Instruction After instruction 
ACC acc [0] 

C C 

OV OV 

NEG ;(OVM = 0) 

Before Instruction After Instruction 
ACC acc [0] 

C C 

Li] 

OV OV 


Assembly Language Instructions 


Example 3 NEG ;(OVM = 1) 
Before Instruction 


ACC 
C 


OV 


After instruction 


ace (0) ( —7errrermn 
Cc 
ou 


OV 
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NMI Nonmaskable Interrupt 


Syntax [label NMI 
Operands None 
Opcode 


15. 14 #13 :#12 #11 :#10 9 8 7 6 5 4 3 2 #1 #90 
1 #0 1 1 7 1 £1 0 0 1 0 1 0 0 1 =«0 


Execution (PC) +1 — stack 
24h — PC 
1 — INTM 
Not affected by INTM. 

Description This instruction forces the program counter to the nonmaskable interrupt vec- 
tor located at 24h. The instruction has the same affect as a hardware non- 
maskable interrupt. Interrupts are globally disabled (INTM=1). Automatic con- 
text save is not performed. 


Words 1 
Cycles [labe NMI 
Cycle Timings for a Single Instruction 
Cycle Timings for a Repeat (RPT) Execution 
~ Not Repeatable | 
t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 

Example NMI ;Control is passed to program memory location 24h and 


;PC+l1 is pushed onto the stack 


4-156 Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


No Operation 


[label] NOP 


None 


15 14 +#13~«212~«211:=C 10 =O 9 8 7 UGLCUD lm LCL 
1 O O 0 1 0 1 #1 0 0 0 0 0 0 O O 


(PC) +1 —- PC 


No operation is performed. The NOP instruction affects only the PC. 
The NOP instruction is useful to create pipeline and execution delays. 


pa © 


[label] NOP 


PR POA PSA PEC 
BL 


Cycle Timings for a Repeat (RPT) Execution 
Cs ee ee 


NOP ;No operation is performed. 
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NORM Normalize Contents of Accumulator 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


4-158 


[label] NORM {ing} 


None 


16 14 13 12 11 10 9 8 7 6 § 4 3 2 1 Q 


1 0 1 0 0 0 0 0 | See Subsection 4.1.2 


(PC) +1 — PC 


If (ACC) = 0; 
Then TC — 1; 
Else, if (ACC(381)) XOR (ACC(30)) = 0: 
Then TC — 0, 
(ACC) x 2 > ACC 
Modify AR(ARP) as specified; 
Else TC > 1. 


Affects TC. 


The NORM instruction normalizes a signed number that is contained in the ac- 
cumulator. Normalizing a fixed-point number separates it into a mantissa and 
an exponent. This is done by finding the magnitude of the sign-extended num- 
ber. ACC bit 31 is exclusive-ORed with ACC bit 30 to determine if bit 30 is part 
of the magnitude or part of the sign extension. If they are the same, they are 
both sign bits, and the accumulator is left-shifted to eliminate the extra sign bit. 


The AR(ARP) is modified as specified to generate the magnitude of the expo- 
nent. It is assumed that AR(ARP) is initialized before normalization begins. 
The default modification of the AR(ARP) is an increment. 


Multiple executions of the NORM instruction may be required to completely 
normalize a 32-bit number in the accumulator. Although using NORM with 
RPT does not cause execution of NORM to fall out of the repeat loop automati- 
cally when the normalization is complete, no operation is performed for the re- 
mainder of the repeat loop. Note that NORM functions on both positive and 
negative 2s-complement numbers. 


Assembly Language Instructions 


Cycles 


Example 1 


Example 2 


Example 3 


_Normalize Contents of Accumulator NORM 


[label NORM {ina} 


Cycle Timings for a Single Instruction 
ae ee |e es 
Cycle Timings for a Repeat (RPT) Execution 


Before Instruction After Instruction 


ARP [aaa ARP a: 

AR2 Lot AR2 

ACC OFFFFFOO1 acc. [o] OFFFE002h 
TC TC 


31-Bit Normalization: 


MAR *,AR1 ;Use AR1 to store the exponent. 
LAR AR1,#0h ;Clear out exponent counter. 
LOOP NORM -=  *+ sOne bit is normalized. 


BCND LOOP,NTC ;If TC = 0, magnitude not found yet. 


15-Bit Normalization: 


MAR *, AR1 ;Use AR1 to store the exponent. 
LAR AR1,#0Fh ;Initialize exponent counter. 
RPT #14 315—bit normalization specified (yielding 
3a 4—bit exponent and 16—bit mantissa). 
NORM *— ;NORM automatically stops shifting when first 


;Significant magnitude bit is found, 
;performing NOPs for the remainder of the 
;repeat loops 


The method in Example 2 is used to normalize a 32-bit number and yields a 
5-bit exponent magnitude. The method in Example 3 is used to normalize a 
16-bit number and yields a 4-bit magnitude. If the number requires only a small 
amount of normalization, the Example 2 method may be preferable to the Ex- 
ample 3 method. This is because the loop in Example 2 runs only until normal- 
ization is complete. Example 3 always executes all 15 cycles of the repeat 
loop. Specifically, Example 2 is more efficient if the number requires three or 
less shifts. If the number requires six or more shifts, Example 3 is more effi- 
cient. 
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NORM Normalize Contents of Accumulator 


ee ee ee 
Note: 


The NORM instruction may be used without a specified operand. In that 
case, any comments on the same line as the instruction are interpreted as 
the operand. If the first character is an asterisk *, then the instruction is as- 
sembled as NORM * with no auxiliary register modification taking place 
upon execution. Therefore, Tl recommends that you replace the NORM in- 
structions with NORM *+ when you want the default increment modifica- 


tion. 
ra ae, | 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


OR With DBMR or Long Immediate OPL 


Direct: [label] OPL [#/k,] dma 
Indirect: [label] OPL [#/k,] {ina} [,next ARP 


0<dmas 127 
Ik: 16-bit constant 
0 =< next ARP <7 


OR DBMR contents with data value 
15 14 13 12 11109 8 7 6 5 4 3 2 1 0 


Direct:} 9 1 0 1 1 0 O 1] 0) Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Indirect} 0 1 0 414 #1414 0 0 44/441 See Subsection 4.1.2 


OR long immediate with data value 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 90 


Direct|_O 1 9 1 1 1 0 1 Data Memory Address 
16-Bit Constant 


15_ 14 13 12 11:10 9 8 7 6 5 4 3 2 1 9 


Ik unspecified: 
(PC) +1 — PC 
dma OR (DBMR) — dma 


Ik specified: 


(PC) +2 — PC 
dma OR |k — dma 


Affects TC. 


If along immediate constant is specified, it is ORed with the value at the speci- 
fied data memory address. If the constant is not specified, the second operand 
to the OR operation is the contents of the dynamic bit manipulation register 
(DBMR). The result of the operation is always written back into the data 
memory location specified. The contents of the accumulator are not affected. 
If the result of the OR operation is 0, then the TC bit is set to 1. Otherwise, the 
TC bit is set to 0. 


1 (Long immediate value not specified) 


2 (Long immediate value specified) 
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OPL OR With DBMR or Long Immediate 


Cycles Direct: [abe OPL [#/k,] dma 
Indirect: [label OPL [#!/k,] {ind} [,next ARP 


Operand SARAM 


ea: ea eS BE: 
a a ee 
| 3t 


Operand EX Baap 


Operand Ext 4n-2+2nd | 4n-2+2nd | 4n-2+2nd 4n+1+2nd+p 


t If the operand and the code are in the same SARAM block. 


Direct: [labe] OPL [#/k,] dma 
Indirect: [label OPL [#/k,] {ina} [,next ARP 


Operand Ext 
Cycle Timings for a Repeat (RPT) Execution 
| Operand DARAM 


Operand SARAM 2n-1 2n-1 2n-1 2n-1+2p 
2n+2t | 


Operand Ext 4n—1+2nd | 4n-14+2nd | 4n—-1+2nd | 4n+2+2nd+2p 


t If the operand and the code reside in same SARAM block. 


Example 1 OPL DAT10 ; (DP=6) 
Before instruction After Instruction 
DBMR OFFFOh DBMR OFFFOh 
Data Memory Data Memory 
30Ah 30Ah OFFF 1h] 
Example 2 OPL #0FFFh,DAT10 ; (DP=6) 
Before Instruction After Instruction 
Data Memory Data Memory 
30Ah 30Ah OFFFh 
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Example 3 


AR3 
DBMR 


Data Memory 
300h 


Example 4 OPL #1111h,*,AR3 
ARP 
AR6 


Data Memory 
306h 


Before Instruction 


Before Instruction 


OR With DBMR or Long 


ARP 
ARS 
DBMR 


Data Memory 
300h 


ARP 
AR6 


Data Memory 
306h 


Immediate OPL 


After instruction 


OFO 


rt 


OFF 


After Instruction 


i 


111F 
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OR OR With Accumulator 


Syntax Direct: [label] OR dma 
Indirect: [label OR {ind} [,next ARP 
Long Immediate: [label OR #l/k [,shiffj 


Operands 0 = dma s 127 
0 < next ARP <7 
Ik: 16-bit constant 
0 < shift < 16 


Opcode 
OR accumulator with data value 
15 14 13 12 11109 8 7 6 § 4 3 2 1 QO 


Direct[ 0 1 1 0 1 1 0 1] 0]  DalaMomory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 #0 


Indirect} O 1 #4 O 1 14 O 14] 11] See Subsection 4.1.2 


OR with ACC long immediate with shift 
15 14 13 12 11 10 9 8 7 6 5§ 4 3 2 i Q 


1 0 14 4 4 4 4 4 FT Ft O OD SHFT t 
Long: 
16-Bit Constant 


OR with ACC long immediate with shift of 16 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 i @Q 


1 1 14 74 4 0 4 0 0 0 0 0 717 O 


7 1 0 
ong: 
16-Bit Constant 


Execution Direct or Indirect Addressing: 


(PC) +1 —- PC 
(ACC(15—0)) OR dma — ACC(15—0) 
(ACC(31-16)) — ACC(31—16) 


Immediate Addressing: 


Not affected by SXM. 


Description The accumulator is ORed with the contents of the addressed data memory lo- 
cation or with a left-shifted long immediate value. The result remains in the ac- 
cumulator. All bit positions unoccupied by the data operand are zero-filled, no 
matter what the value of the SXM status bit is. Thus, the high word of the accu- 
muiator is unaffected by this instruction if direct or indirect addressing is used, 
or if immediate addressing is used with a shift of zero. Zeros are shifted into 
the least significant bits of the operand if immediate addressing is used with 
a nonzero shift count. 


Words 


awh, 


(Direct or indirect addressing) 


NO 


(Long immediate addressing) 


4-164 Assembly Language Instructions 


Cycles 


Example 1 


Example 2 


OR With Accumulator OR 


Direct: [label OR dma 
Indirect: [label OR {ind} [,next ARP 
Cycle Timings for a Single Instruction 
Operand DARAM ae ee 
Operand SARAM 1 1 
ot 
Operand Ex 
Cycle Timings for a Repeat (RPT) Execution 
Operand SARAM n n+p 
n+1 
Operand Ext 
t If the operand and the code are in the same SARAM block. 
Long Immediate: [label OR #/k [,shiffj 


Cycle Timings for a Single Instruction 


Not Repeatable 


OR DAT8 ;(DP = 8) 
Before instruction After Instruction 
Data Memory Data Memory 
408h OFOOOh 408h OFOOOh 


ACC | 100002h ACC 10F002h 
Cc Cc 


OR *, ARO 


Before instruction After instruction 


Pla eee ARP aaa) 
ARI ARI 

Data Memory | Data Memory 

300h 300h 

ACC ACC 
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OR OR With Accumulator 


Example 3 OR #08111h,8 
Before Instruction After instruction 
ACC OFFOOOOh acc [x] 
C Cc 


4-166 Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label] ORB 


None 


1514 #13 #12 #11 :+#10 9 8 7 6 5 4 3 2 i @Q 
1 O 1 #1 =#1 1 1 0 0 0 0 1 0 0 1 =#=1 


(PC) +1 —- PC 
(ACC) OR (ACCB) - ACC 


The contents of the accumulator are ORed with the contents of the accumula- 
tor buffer (ACCB). The result is placed in the accumulator. 


1 
[label ORB 


PR POA PSA PE 
ee 
NS 


ORB 
Before Instruction After Instruction 
ACC ACC 
Cc Cc 
ACCB ACCB 
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OUT Output Data to Port 


Syntax Direct: [label OUT dma, PA 
Indirect: [label OUT {ina}, PA [,next ARP 
Operands 0 <dmas 127 
0 < next ARP = 7 
0=< PA= 65535 
Opcode 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 0 
virect| 2 0 9 0 1 1 0 OT OT  DataMemory Address 
16-Bit Constant 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 _ 1 0 
direct 2 0 0 0 1 1 0 O[1]|  SeeSubsecionai2 
Execution (PC) +2 — PC 
While (repeat counter) = 0 
Port address — address bus A15—A0 
(dma) — Data bus D15—D0 
Port address + 1 — Port address 
(repeat counter — 1) — (repeat counter) 
(dma) — (port address) 
Description The OUT instruction writes a 16-bit value from a data memory location to the 
specified I/O port. The IS line goes low to indicate an |/O access, and the 
STRB, RW, and READY timings are the same as for an external data memory 
write. Note that port addresses 50h—5Fh are memory-mapped (see subsec- 
tion 5.1.1); the other port addresses are not. 
RPT can be used with the OUT instruction to write consecutive words from 
data memory to I/O space. In the repeat mode, the port address (PA) is increm- 
ented after each access. 
Words 2 
Cycles Direct: [label] OUT dma, PA 


Indirect: [label OUT {ina}, PA [,next ARP 


Cycle Timings for a Single Instruction 


Source SARAM 3+iOgs¢ 3+10gst 3+1Ogs¢ 5+i0gst+2Pcode 
4+idgo¢t 


3+derotiOgst 6+dJsrotiOast+2Pcode 


4-168 Assembly Language Instructions 


ee ea ee cc s—CsisCSCCCOutput Data to Port, OUT 


Cycle Timings for a Repeat (RPT) Execution 


Source SARAM | 3n+nidge; 3N+NiOgst 3N+NiOgst 3N+3+NiOgs++2Pcode 
3n+1+nidgsst 
Source Ext 5n—2+NderotNiOgse | SN—-2+Ndgro+NiOgse | SN—2+Ndgret NiOgse | SN+1+Ndgre+Nidgs¢+ 
2Pcode 


t If the source operand and the code are in the same SARAM block. 


Example 1 OUT DATO,PA7 ;(DP = 4) Output data word stored in data memory 
slocation 200h to peripheral on port address 7. 


Example 2 OUT *,PALS ;Output data word referenced by current auxiliary 
sregister to peripheral on port address 15. 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-170 


PAC § Load Accumulator With Product Register 


[label] PAC 
None 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 14 0 
1 0 1 1 4 141 14 0 0 00 00 01 1 


(PC) +1 — PC 
(shifted P register) —- ACC 


Affected by PM. 


The contents of the P register, shifted as specified by the PM status bits, are 
loaded into the accumulator. 


1 
[label PAC 


Cycle Timings for a Single Instruction | | | 


| Cycle Timings for a — (RPT) Execution 


PAC ;(PM = 0) 


Before Instruction After Instruction 
p P 
ACC [23h] ACC 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


POP 


Pop Top of Stack to Low Accumulator 


[label POP 


None 


15 14 #13 :#12 #11 °#10 9 8 7 6 5 4 3 2 1 0 
1 O 1 1 1 1 1 0 0 0 1 1 O O 1 O 


(PC) +1 — PC 
(TOS) — ACC(15—-0) 
0 — ACC(31-—16) 
Pop stack one level 


The contents of the top of the stack (TOS) are copied to the low accumulator, 
and the stack is popped after the contents are copied. The upper half of the 
accumulator is set to all zeros. 


The hardware stack is last-in, first-out with eight locations. Any time a pop oc- 
curs, every stack value is copied to the next higher stack location, and the top 
value is removed from the stack. After a pop, the bottom two stack words will 
have the same value. Because each stack value is copied, if more than seven 
stack pops (POP, POPD, RETC, RETE, RETI, or RET instructions) occur be- 
fore any pushes occur, all levels of the stack contain the same value. No provi- 
sion exists to check stack underflow. 


1 
[label] POP 


Cycle Timings for a Single Instruction 
Cee eee oe 7 ae 
Cycle Timings for a Repeat (RPT) Execution 


n+p 


POP 
Before instruction After Instruction 
ACC ACC 
Cc Cc 
Stack Stack 
(83h) 
:<) 
3) 
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Syntax Direct: [label POPD dma 
Indirect: [label] POPD {ina} [,next ARP 


Operands 0 <dmas 127 
0 <= next ARP < 7 
Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 0 
Direct:|} 1 O O O 1 O 1 O | Data Memory Address 
15 14 13 12 11 #10 9 8 7 6 5 4 3 2 #1 0 
Indirect} 1 0 0 0 1 +0 1 +Ojf 1] See Subsection 4.1.2 
Execution (PC) +1 —- PC 
(TOS) — dma 
POP stack one level 
Description The value from the top of the stack is transferred into the data memory location 
specified by the instruction. The values are also popped in the lower seven lo- 
cations of the stack. The stack operation is described in the previous instruc- 
tion, POP. The lowest stack location remains unaffected. No provision exists 
to check stack underflow. 
Words 1 
Cycles Direct: [label POPD dma 


Indirect: [label POPD {ina} [,next ARP 


Cycle Timings for a Single Instruction 


t If the operand and the code are in the same SARAM block. 


4-172 Assembly Language Instructions 


Example 1 POPD DAT10 ;(DP = 8) 
Before Instruction 
Data Memory 
40Ah 7) 
Stack 
75h 
Example 2 POPD *+,AR1 
Before Instruction 
ARP [ar 
ARO 
Data Memory 
300h Ch] 
Stack 
75h 


Data Memory 
40Ah 


Stack 


ARP 
ARO 


Data Memory 
300h 


Stack 
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PSHD Push Data Memory Value Onto Stack 


Syntax Direct: [label PSHD dma 

Indirect: [/abefl PSHD {ina} [,next ARP] 
Operands 0 <dmas 127 

0 <= next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 QO 


Direct 1 1 1 0 1 1 01 Data Memory Address 


15 14 13 #12 #11 #10 5 4 3 2 


\<e) 
foe) 
N 
o>) 
_ 
© 


Indirect} 0 141 #1 #4 +0 14 +14 =«0f 11 See Subsection 4.1.2 
Execution (dma) — TOS 
(PC) +1 — PC 


Push all stack locations down one level. 


Description The value from the data memory location specified by the instruction is trans- 
ferred to the top of the stack. The values are also pushed down in the lower 
seven locations of the stack, as described in the PUSH instruction. The lowest 
stack location is lost. 


Words | 


Cycles Direct: [label PSHD dma 
Indirect: [labef| PSHD {ind} [,next ARP 


Cycle Timings for a Single Instruction 


1+p 


Operand SARAM 


LPR | PDA 
Operand DARAM a 


Operand i 


Cycle Timings for a Repeat (RPT) Execution 


[PR [POA [PSA [PE 
Operand DARAM 0 


Operand SARAM n n+p 
n+1t 
Operand Ex 


| if the operand and the code are in the same SARAM block. 


4-174 Assembly Language Instructions 


Example 1 PSHD 


Data Memory 
1FFh 


Stack 


Example 2 PSHD *,AR1 
ARP 
ARO 


Data Memory 
1FFh 


Stack 


DAT127 ;(DP = 3) 


Before instruction 


UU 


Before Instruction 


1FF 


UU 


Push Data Memory Value Onto Stack _PSHD 


Data Memory 
1FFh 


Stack 


ARP 
ARO 


Data Memory 
1FFh 


Stack 


After instruction 


tN 


After instruction 


1FF 


IU 


4-175 


PUSH Push Low Accumulator Onto Stack 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-176 


[label PUSH 


None 


15 14 #13 #12 +#11:#10 9 8 7 6 5 4 3 2 1 0 
tT O 1 14 4 4 4+ O O0 OO 4 4 4 1 +O OD 


(PC) +1 —- PC 
Push all stack locations down one level 
ACC(15—-0) — TOS 


The contents of the lower half of the accumulator are copied onto the top of 
the hardware stack. The stack is pushed down before the accumulator value 
is copied. 


The hardware stack is last-in,first-out with eight locations. If more than eight 
pushes (due to CALA, CALL, CC, PSHD, PUSH, TRAP, INTR, and NMI in- 
structions) occur before a pop, the first data values written will be lost with each 
succeeding push. 


1 
[label] PUSH 


Cycle Timings for a Single Instruction 


PUSH 
Before Instruction After instruction 
ACC ACC 
C Cc 
Stack a) Stack 
Ch] 
LC 12h] 
LHI ee) 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


[label RET[D| 


15.14 13 #12 ~#11:°#10 9 8 7 6 5 4 3 2 1 #9O 


15 (14 #13 ~#«12 «11 +#10 «9 8 7 6 5 4 3 2 1 #90 
1 1 1 1 #1 1 1 1 O 0 0 O08 0 O0 OO QO 


(TOS) — PC 
Pop stack one level. 


The contents of the top stack register are copied into the program counter. The 
stack is then popped one level. RET is used with CALA, CALL, and CC for sub- 
routines. The two one-word instructions or one two-word instruction following 
the RET instruction are fetched and executed before the execution of the re- 
turn, if the delayed version is specified with the “D” suffix. 


1 


[label RET 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


[label] RETD 


Cycle Timings for a Single Instruction 
hee |e eee 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 
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RET Return From Subroutine 


Example 1 RET 
PC 
Stack 
Example 2 RETD 
MAR *,4 
LACC #1h 
PC 
ARP 
ACC 
Stack 
4-178 


Before instruction 


Before Instruction 


After Instruction 
PC 
Stack 


wo ~ 
= J J | 


After Instruction 
PC 37 


ARP 
ACC 
Stack 


It 


75 


zs 


SF 


i 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Return Conditionally RETC 


[label RETC [D| [cond] [, cona2] [,...] 


Conditions: ACC=0 EQ 
ACC#0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C= NC 
C=1 C 
OV=0 NOV 
OV=1 OV 
BIO low BIO 
TC= NTC 
TC=1 TC 
Unconditional UNC 

RETC: 


15 14 #13 :#12 ~#«11:°#10 9 8 7 6 5 4 3 2 1 0 


1 4 #1 0 141 =«14 ~TRT ZLVC t ZLVC 1 


RETCD: 
15 14 #13 #12 +11 °+#10 9 8 


7 6 5 4 3 2 1 O 
1 4 #4 #14 «4 ~«47 *TRT | ZLVC t ZLVC Tt 


If (Condition(s)) then 
(TOS) — PC 
Pop stack one level. 
Else, continue 


A standard return, RET, is executed if the specified conditions are met. Note 
that not all combinations of conditions are meaningful. The two one-word in- 
structions or one two-word instruction following the RETC are fetched and ex- 
ecuted before the execution of the return, if the delayed version is specified 
with the “D” suffix. If the delayed instruction is specified, the two instruction 
words following the RETCD instruction have no effect on the conditions being 
tested. 


1 
[label RETC [cond] [, cona2] [,...] 


Cycle Timings for a Single Instruction 
[Gondiionstue ee 
Condition False es ee 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 
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RETC Return Conditionall 


[label RETCD [cond7] [, cona2] [,...] 


Cycle Timings for a Single Instruction 


Condition False 


Cycle Timings for a Repeat (RPT) Execution © 
Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


Example 1 
RETC GEQ,NOV ;A return, RET, is executed if the 
saccummulator contents are positive and the 
°OV bit is a zero. 
Example 2 
RETCD C sA return, RET, is executed if the carry 
MAR* , 4 sbit is set. The two instructions following 
LARAR3 , #1h sthe return instruction are executed 


sbefore the return is taken. 


4-180 Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


io interrupts and Return From interrupt 


W~RETE 


[label RETE 


None 


15_ 14 #13 #12 :#11 ~+#10 9 8 7 6 5 4 3 2 +1 =O 
1 O 1 #14 1 1 1 0 0 0 14 14 14 O FT O 


(TOS) — PC 
Pop stack one level. 
0 — global interrupt enable (INTM bit in STO) 


The contents of the top stack register are copied into the program counter. The 
stack is then popped one level. RETE automatically clears the global interrupt 
enable bit to O (INTM in STO) and pops the shadow register values (see RETI 
description). RETE is the equivalent of setting the INTM bit to 0 and executing 
a RETI instruction. 


1 
[label RETE 


Cycle Timings for a Single Instruction 


Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


RETE 
Before Instruction After Instruction 
PC PC 
STO STO 
Stack Stack 
75h Cth 
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RET! Return From Interrupt 


Syntax [label RETI 
Operands None 
Opcode 


15 14°+=«13 =~ 12 — C11 10 9 ~ 87 UGE LCUDG eK LCUDT LULL 
1 O 141 141 4 4 4 0 0 0 14 14 4 0 0 0 


Execution (TOS) — PC 
Pop stack one level. 


Description The contents of the top stack register are copied into the program counter. The 
RETI instruction also pops the values in the shadow registers (stored when an 
interrupt was taken) back into their corresponding strategic registers. The fol- 
lowing registers are shadowed: ACC, ACCB, PREG, STO, ST1, PMST, ARCR, 
INDX, TREGO, TREG1, and TREG2. The XF bit in status register ST1 is not 
saved or restored to/from the shadow registers during interrupt service rou- 


tines. 
Words 1 
Cycles [label] RETI 


Cycle Timings for a Single Instruction 


Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


Example RETI 
Before Instruction After Instruction 


PC PC 
Stack Stack 


75h [2th] 
eel 
ee 3) 
CH [CSE 


4-182 | Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


.ftotate Accumulator Left ROL 


[label ROL 


None 


15.14 #13 #12 #11:#10 9 8 7 6 5 4 3 2 #1 #90 
1 O 1 1 #1 #1 1 0 0 0 0 0 1 +1 +0 #0 


(PC) +1 — PC 

C — ACC(0) 

(ACC(31)) ~ C 
(ACC(380-0)) — ACC(31—1) 
Affects C. 

Not affected by SXM. 


The ROL instruction rotates the accumulator left one bit. The MSB is shifted 
into the carry bit, and the value of the carry bit from before the execution of the 
instruction is shifted into the LSB. 


1 
[label ROL 


Cycle Timings for a Single Instruction __ | 


Cycle Timings for a Repeat (RPT) Execution 
Cs Ls Mc 0, 


ROL 
Before Instruction After Instruction 


acc {o] 0B0001234h acc [1] 60002468h] 
Cc Cc 
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ROLB Rotate ACCB and Accumulator Left 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-184 


[label] ROLB 


None 


156 14 #13 :#12 #11':#10 9 8 7 6 5 4 3 2 1 O 
1 O 1 4 4 74 4 0 0 0 0 4 0 4 0 0 


(PC) +1 — PC 

C — ACCB(0) 

(ACCB(30-0)) -» ACCB(31—1) 
(ACCB(31)) —- ACC(0) 
(ACC(30-0)) -» ACC(31-1) 
(ACC(31)) + C 


Affects C. 
Not affected by SXM. 


The ROLB instruction causes a 65-bit rotation. The contents of both the accu- 
mulator (ACC) and accumulator buffer (ACCB) are rotated to the left by one 
bit. The MSB of the original contents in the accumulator shifts into the carry 
position. The original value of the carry bit (C) shifts into the LSB position of 
the accumulator buffer, and the MSB of the original contents of the accumula- 
tor buffer shifts into the LSB position of the accumulator. 


1 


[label] ROLB 


Cycle Timings for a Single Instruction 


ROLB 
Before instruction After Instruction 
acc [1] acc [9] 
Cc Cc 
ACCB ACCB 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


LO AAA A Ao AN AA A EEE LET RENNER OILERS IRI SR ELL D IEEE LEE SLI ILS RIESE REL ED 


Rotate Accumulator Right ROR 


Fal atet atten Ce ee Pa a gee EEE EOE EEO RIDE OEE EOE OORT DEE 


[label] ROR 


None 


15.14 :(13 #12 #i1=:10 9 8 7 6 5 4 3 2 Ji 0 
1 O 1 1 1 1 1 0 0 0 0 0 1 1 oO 1 


(PC) +1 — PC 
C — ACC(31) 
(ACC(0)) + C 
(ACC(31-1)) -» ACC(30-0) 


Affects C. 
Not affected by SXM. 


The ROR instruction rotates the accumulator right one bit. The LSB is shifted 
into the carry bit, and the value of the carry bit from before the execution of the 
instruction is shifted into the MSB. 


1 
[label ROR 


Cycle Timings for a Single Instruction 


a 
OO 


ROR 


Before Instruction After Instruction 


acc [o] 0B0001235h ACC 5800091Ah 
C C 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-186 


RORB Rotate ACCB and Accumulator Right 


[label RORB 


None 


15 14 #13 :#12 11:10 9 8 7 6 5 4 3 2 +#1 «9 
1 O 14 14 4 4 4 0 0 0 0 4 0 4 0 1 


(PC) +1 — PC 
C — ACC(31) 

(ACC(31-1)) -» ACC(30-0) 
(ACC(0)) -» ACCB(31) 
(ACCB(31-1)) - ACCB(30-0) 
(ACCB(0)) > C 


Affects C. 
Not affected by SXM. 


The RORB instruction causes a 65-bit rotation. The contents of both the accu- 
mulator (ACC) and accumulator buffer (ACCB) are rotated to the right by one 
bit. The LSB of the original contents in the accumulator buffer shifts into the 
Carry position. The original value of the carry bit (C) shifts into the MSB position 
of the accumulator, and the LSB of the original contents of the accumulator 
shifts into the MSB position of the accumulator buffer. 


1 
[label RORB 


Cycle Timings for a Single Instruction 
ee ee eee i er 
Cycle Timings for a Repeat (RPT) Execution 


RORB 
Before Instruction After Instruction 
acc [1] acc [0] 
C Cc 
ACCB ACCB 


Assembly Language Instructions 


Repeat Next Instruction RPT 


SRS SS SSS SOROS OOOO RF OO IIE VIS AIA IID 


Pr ere ree rere er cre reiterate pret per ret tered tate ttneyn tat tree tice trate rere tater re et telat tr ccrrter eiterrtirrereies teeadesterttay ete tcetsnetersaa tency tren trie trent ote iter eer nent ann caeien eieetaee 


Syntax Direct: [label RPT dma 
Indirect: [label RPT {ind} [,next ARP 
Short Immediate: [label] RPT #k 
Long Immediate: [label RPT #/k 


Operands 0s dma < 127 
0 <s next ARP <7 
0O<ks 255 
0 s Ik s 65535 
Opcode 


Repeat next instruction 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 0 


Direct} 0 0 0 0 1 0 1 #1{ 0) Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 =O 


Indirect:| 0 0 0 0 1 O 1 #1] 11 See Subsection 4.1.2 


Repeat next instruction specified by long immediate 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 1 9 
1 O 1 #71 1 1 i 0 1 #1 O OO OO 1 DO ODO 


Long: . 
| 16-Bit Constant 


Repeat next instruction specified by short immediate 
15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 O 


Short:| 1 0 1 1 1 0. 861 1 8-Bit Constant 


Execution Direct or Indirect Addressing: 
PC)+1—- PC 
dma) — RPTC 


Pee te 


Short Immediate Addressing: 
PC) +1 —- PC 
k — RPTC 


-~ 


Long Immediate Addressing: 
(PC) +2 — PC 
Ik — RPTC 


Description The repeat counter (RPTC) is loaded with the addressed data memory loca- 
tion if direct or indirect addressing is used, an 8-bit immediate value if short 
immediate addressing is used, or a 16-bit immediate value if long immediate 
addressing is used. The instruction following the RPT is repeated n times, 
where nis one more than the initial value of the RPTC. Since the RPTC cannot 
be saved during a context switch, repeat loops are regarded as multicycle in- 
structions and are not interruptible. However, the processor can halt a repeat 
loop in response to an external HOLD signal. The execution restarts when 
HOLD/HOLDA are deasserted. The RPTC is set to zero on a device reset. 


4-187 


Words 


Cycles 


Example 1 


4-188 


RPT is especially useful for block moves, multiply-accumulates, normaliza- 
tion, and other functions. The repeat instruction itself is not repeatable. 


1 (Direct, indirect, or short immediate addressing) 
2 (Long immediate addressing) 


Direct: [label] RPT dma 
Indirect: [label RPT {ind} [,next ARP 


Cycle Timings for a Single Instruction 


Operand DARAM 


Operand SARAM 


1 
Operand ex zx 


Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


t If the operand and the code are in the same SARAM biock. 


Short Immediate: [label RPT #k 


Cycle Timings for a Single Instruction 
ln <n Cs | ee 
Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


Long Immediate: [/abel] RPT #/k 


Cycle Timings for a Single Instruction 
ir <a ee i ee 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable | 


RPT  DAT127 ;(DP = 31) 


Before Instruction After Instruction 

Data Memory Data Memory 
OFFFh OFFFh 
RPTC RPTC 


Assembly Language Instructions 


Example 2 


Example 3 


Example 4 


RPT 


RPT 


Before Instruction 


a a 
ARO ARO 


Data Memory Data Memory 
300h OFFFh 300h 
RPTC RPTC 


#1 ;Repeat next instruction 2 times. 
Before Instruction 


RPTC [on] SP TC 


#1111h ;Repeat next instruction 4370 times. 
Before Instruction 


RPTC RPTC 


After Instruction 


300 


OFFF 
OFFF 


After Instruction 


—_ 
Zz 


After Instruction 
1111h 


4-189 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-190 


[label] RPTB pma 
O<pma s 65535 


1 — BRAF 
PC+2 —» PASR 
pma — PAER 


The RPTB instruction allows a block of instructions to be repeated a number 
of times specified by the memory-mapped block repeat count register (BRCR) 
without any penalty for looping. The BRCR must be loaded before execution 
of an RPTB instruction. When the RPTB is executed, the start and end address 
pointers PASR and PAER are loaded with PC+2 and pma, respectively. The 
block-repeat-active status bit (BRAF) is set to one. Block repeat can be deacti- 
vated by clearing the BRAF bit. The number of loop iterations is given by 
(BRCR) + 1. 


The RPTB instruction is interruptible. However, RPTB instructions cannot be 
nested unless the BRCR, PAER, and PASR registers are appropriately saved 
and restored and the block repeat active flag (BRAF) is properly set. Single-in- 
struction repeat loops (RPT, RPTZ) can be included as part of RPTB blocks. 


2 
[label] RPTB pma 


Cycle Timings for a Single Instruction _ 


PE a) 
aaien i aE 


Cycie Timings for a Repeat (RPT) Execution 


Not Repeatable 


SPLK #iterations_ minus _ 1,BRCR;initialize BRCR 
RPTB end_block — 1 
LACC DAT1 
ADD DAT2 
SACL DAT1 
end_block 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


Long Immediate: {label RPTZ #/k 


0 <s|Ik=s 65535 


15 14 13 12 11:10 9 8 7 6 5 4 3 2 1 #0 
1 O 1 141 1 #1 +1 ~=«:0 1 «4 0 0 0 1 +O 1 


16-Bit Constant 


(PC) +1 — PC 
Ik - RPTC 


The RPTZ instruction clears the accumulator and product register and repeats 
the instruction following the RPTZ ntimes, where n= /k+7. RPTZ is equivalent 
to the following instruction sequence: 


MPY #0 
PAC 
RPT #<1lk> 


2 
Long Immediate: [label] RPTZ #/k 


Cycle Timings for a Single Instruction 
[ae ie i Ec 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


RPTZ #7FFh ;Zero product register and accumulator. 
MACD-_ pma, *+ ;Repeat MACD 2048 times. 
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SACB _ Store Accumulator in ACCB 


Syntax 


Operands 
Opcode 


Execution 


Description 
Words 
Cycles 


Example 


4-192 


[label] SACB 
None 


15 14°13 #12 ~+11':+#10 9 8 7 6 5 4 3 2 i 0 
1 O 1 14 141 #1 #=1«~=40 0 0 0 1 +1 ~=«14 ~«1 ~«0 


(PC) +1 —- PC 
(ACC) + ACCB 


The accumulator contents are copied to the accumulator buffer (ACCB). 
1 
[label SACB 


Cycle Timings for a Single Instruction 


SACB 
Before instruction After Instruction 
ACC ACC 
ACCB ACCB 


Assembly Language Instructions 


einrae rentecaterereretarceeteteneenetrrate Penstetetcttetttatstettietatetetrtatetsteretreteteetacntcistatetsteratetrtstnnena 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Cycles 


Cee nn an nani a an nnn on a ne oo on ar a nt a a a No nO I aN I OO I I I I IS RTL APRA NT ASOT IAI INS SOI ISTIC OTIS ID IIOID 


Store High Accumulator With Shift SACH 


OAR LOL SII 


Direct: [label SACH dma |[,shiffj 
Indirect: [labef SACH {ind} [,shiffl,next ARP] 


0 <dmas 127 
0 < next ARP <7 
Osshift<7 (defaults to 0) 


15 14 = = a 10 9 5 4 3 2 1 0 
Direct 1 0011 | SAF [0] Data Memory Adress 
15 14 13 12 11 #10 =@Q 6 5 4 3 2 1 


(coo tt] SET [1] See Subeecion 41. 


tT See Section 4.5. 


(PC)+1 — PC 
[(ACC) x 2shift] — dma 


Not affected by SXM 


The SACH instruction copies the entire accumulator into a shifter, where it left- 
shifts the entire 32-bit number from 0 to 7 bits. It then copies the upper 16 bits 
of the shifted value into data memory. The accumulator itself remains unaf- 
fected. 


1 


Direct: [label] SACH dma |[{,shiff| 
Indirect: [label SACH {ind} [,shiff{,next ARP] 


Pn 
La 
at | 


Operand DARAM 


Operand SARAM. n | n+p 
n+2t 
Dperand SEE 


t If the operand and the code are in the same SARAM block. 
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Example 1 SACH DAT10,1 


ACC 4208001h 
Cc 


Data Memory 
20Ah 


Example 2 SACH *+,0,AR2 


AR1 


;(DP = 4) 


Before Instruction 


Before Instruction 


300h 


ACC 4208001h 
Cc 


Data Memory 
300h 


4-194 


After Instruction 


ACC 4208001h 
Cc 


Data Memory 
20Ah 0841h 


After Instruction 


ARP aa) 

ARI 

ACC 4208001h 
Cc 


Data Memory 
300h 0420h 


Assembly Language Instructions 


Store Low Accumulator With Shift SACL 


Syntax Direct: [label SACL dma |[,shiffj 
Indirect: [label SACL {ind} [,shiffl,next ARF] 

Operands 0 <dmas 127 
0 <= next ARP <7 
O<shift<7 (defaults to 0) 

Opcode 

15. (14°13 #12 ~#11:~#210 9 8 7 6 5 4 3 2 +1 =O; 
Direct: 1 0 0 1 O] SHFT [OJ]  DataMemoryAddress 
15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 =O 
Indirect! 1 0 0 1 O] SHE? [1]  SeeSubsection4i2 
t See Section 4.5. 
Execution (PC) +1 — PC . 
16 LSBs of [(ACC) x 2shift] = dma 
Not affected by SXM. 

Description The low-order bits of the accumulator are shifted left from 0 to 7 bits, as speci- 
fied by the shift code, and stored in data memory. The low-order bits are filled 
with zeros on the shift, and the high-order bits are lost. The accumulator itself 
remains unaffected. 

Words 1 

Cycles Direct: [label] SACL dma |[,shiff 


Indirect: [label SACL {ind} [,shifij,next ARF] 


Cycle Timings for a Single Instruction 


| Operand DARAM 
Operand SARAM 


T If the operand and the code are in the same SARAM block. 
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Example 1 SACL DAT11,1 ;(DP = 4) 


c 


Before Instruction After Instruction 
ACC ACC 
C C 
Data Memory Data Memory 
20Bh 20Bh 
Example 2 SACL *,0,AR7 
Before Instruction After Instruction 


ARP Cd ARP 

ARG AR6 

ACC OOFF 8421h ACC OOFF 8421h 
Cc Cc 


Data Memory | Data Memory 


300h 300h 


4-196 Assembly Language Instructions 


Store Accumulator in Memory-Mapped Register SAMM 


SOLLSISSSESSSS ORS SESSES SSS SESS SESE I SN,  < <__ cK“ °c <_L oS a I I CCS SSSI SSS ESI 


Syntax Direct: [label SAMM dma 
Indirect: [label] SAMM {ind} [, next ARP 


Operands 0sdmas 127 
O <s next ARP <7 


Opcode 
6 5 4 3 2 1 JO 


7 
| 0 | Data Memory Address 
7 


6 5 4 3 2 1 OQ 


15 14 13 #12 11 #10 9 8 
Direct!} 1 0 O O 14 0 0 0 
8 

0 


15 14 13 12 #%111 =#10 «9 


Execution (PC) +1 — PC 
(ACC) — dma(0-7) 


Description The low word of the accumulator is copied to the addressed memory-mapped 
register. The upper 9 bits of the data address are set to zero, regardless of the 
current value of DP or the upper 9 bits of AR(ARP). This instruction allows the 
accumulator to be stored to any memory location on data page 0 without modi- 
fying the DP field in status register STO. 


Words | 


Cycles Direct: [label SAMM dma 
Indirect: [label SAMM {ind} [, next ARP 


Cycle Timings for a Single Instruction 


~ Cycle Timings for a Repeat (RPT) Execution 
Opwrananne [a ne fe 
Operard MMPORT rio 


t Add one more cycle if source is a peripheral memory mapped register. 
+ Add nmore cycles if source is a peripheral memory mapped register. 


Example 1 SAMM PRD ;(DP = 6) 
Before Instruction After instruction 
ACC acc [___ 80h 
PRD PRD 
Data Memory Data Memory 
325h 325h 
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SAMM _ Store Accumulator in Memory-Mapped Register 


Example 2 SAMM *,AR2 ;(BMAR = 1Fh) 

Before Instruction 
ARP 
AR7 
ACC [08 0h 
BMAR 

Data Memory 

a cL) 
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ARP 

AR7 

ACC 
BMAR 


Data Memory 
31Fh 


After Instruction 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Direct: [label SAR AR, dma 
Indirect: [label] SAR AR,{ind} [,next ARP 


0<dmas 127 
O< AR=x<7 
0 < next ARP <7 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 =O 


Direct:} 1 0 0 0 O Data Memory Address 


15 14 13 #12 +d £10 9 8 7 6 5 4 3 #2 1 
0 O 0 


0 
Indirect: See Subsection 4.1.2 


t See Section 4.5. 


(PC) +1 — PC 
(AR) — dma 


The contents of the designated auxiliary register (ARx) are stored in the ad- 
dressed data memory location. When the contents of the current auxiliary reg- 
ister are modified in the indirect addressing mode, SAR ARn (when n = ARP) 
stores the value of the auxiliary register contents before it is incremented, de- 
cremented, or indexed by INDX. 


wh, 


Direct: [label] SAR AR, dma 
Indirect: [label SAR AAR,{ind} [,next ARP 


Cycle Timings for a Single Instruction 
Ope 4p 
1 
ot 
n 


Operand DARAM 


po fee 
“oe 
iS fC oe ie ee 
Re a a (RS Ca (Eee (a 
prensa fe 
n+ 


Operand EX ane 2dep 


t If the operand and the code are in the same SARAM block. 


SAR  ARO,DAT30 ;(DP = 6) 


Before instruction After Instruction 

ARO ARO 
Data Memory Data Memory 

31Eh 31Eh 
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Example 2 SAR ARO, *+ 


Before Instruction After Instruction 

ARO ARO 402h 
Data Memory Data Memory 

401h 401h _____ 40th 


4-200 Assembly Language Instructions 


Syntax 
Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 


SATH 


SATH 


None 


15 14 13 12 11:#10 9 8 7 6 5 4 3 2 +1 =O 
1 O 1 1 1 1 14 0 0 14 #0 1 1 O 71 O 


(PC) +1 —- PC 
16 x (TREG1(4)) — count 
(ACC) right-shifted by count — ACC 


Affected by SXM. 


The accumulator is barrel-shifted right by 16 bits if bit 4 of TREG1 is a one. If 
bit 4 of TREG1 is a zero, the accumulator is unaffected. Zeros are shifted in 
if SXM=0. Copies of ACC(31) are shifted in if SXM=1. The SATH instruction 
in conjunction with the SATL instruction allows a 2-cycle 0- to 31-bit right shift. 
The carry bit is unaffected. 


Before Instruction After Instruction 


ACC OFFFFO000h ACC OOOOFFFFh 
Cc Cc 


TREG1 TREG1 


SATH ;(SXM = 1) 
Before Instruction After Instruction 


ACC OFFFFOO00h ACC OFFFFFFFFh 
Cc Cc 
mest = [xis] TREG 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


4-202 


SATL 


None 


1514 #13 #12 11 :#10 9 8 7 5 2 0 
1 O 1 71 = 1 1 1 0 0 14 0 4 4 0 714 =O 


(PC) +1 — PC 
(TREG1(3—-0)) — count 
(ACC) right-shifted by count - ACC 


Affected by SXM. 


The accumulator is barrel-shifted right by the value specified in the 4 LSBs of 
TREG1. Zeros are shifted in if SXM=0. Copies of ACC(31) are shifted in if 
SXM=1. The SATL instruction in conjunction with the SATH instruction allows 
a 2-cycle 0- to 31-bit right shift. The carry bit is unaffected. 


1 
SATL 


PR POAT PSA PE 
a 
0 


SATL ;(SXM = 0) 


Before Instruction After instruction 


ACC OFFFFO000h ACC 3FFFCO0Oh 
Cc C 
TREG1 TREG1 CO x2] 


SATL ;(SXM = 1) 
Before Instruction After Instruction 


ACC OFFFFOO000h ACC OFFFFC000h 
Cc Cc 
meGt = (RTH = 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label SBB 


None 


15 14 :+#13 «#12 ~«11:+#«10 9 8 7 6 5 4 3 2 i 9 
1 oO 1 1 1 1 1 0 O 0O O 1 1 OO O O 


(PC) +1 — PC 
(ACC) — (ACCB) - ACC 


The contents of the accumulator buffer (ACCB) are subtracted from the con- 
tents of the accumulator. The result is stored in the accumulator, and the accu- 
mulator buffer is not affected. The carry bit is reset to zero if the result of the 
subtraction generates a borrow. 


, 
[labe| SBB 


nae! cen Libel 5 SeamRameRE 


SBB 


Before Instruction After Instruction 


ACC acc = [41] _[___10000000h 
C C 
ACCB 10000000h] ACCB | 10000000h] 
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SBBB Subtract ACCB From Accumulator With Borrow 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 
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[label SBBB 


None 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 $1 iO 
1 O 1 4 4 4 1 O O80 O0 0 141 141 0 0 +41 


(PC) +1 — PC 
(ACC) — (ACCB) — (Logical inversion of C) — ACC 


The contents of the accumulator buffer (ACCB) and the logical inversion of the 
carry bit are subtracted from the accumulator(ACC). The results are stored in 
the accumulator, and the accumulator buffer is not affected. The carry bit is set 
to zero if the result generates a borrow. 


1 


[label] SBBB 


: Cycle Timings for a Single Instruction 


SBBB 
Before Instruction After Instruction 
acc [1] acc [1] 
C C 
ACCB ACCB 
SBBB 
Before Instruction After Instruction 


acc [0] ACC 
Cc C 
ACCB 098010h ACCB [-__098010h| 


Assembly Language Instructions 


Subtract From Auxiliary Register Short Immediate SBRK 


ELEC ER ERS ASO OOOO OOOO OI oC ss 


Syntax [label SBRK #k 
Operands 0<k < 255 
Opcode 

~ 
Execution (PC) +1 — PC 


AR(ARP) — 8-bit positive constant - AR(ARP) 


Description The 8-bit immediate value is subtracted, right-justified, from the currently se- 
lected auxiliary register with the result replacing the auxiliary register contents. 
The subtraction takes place in the ARAU, with the immediate value treated as 
a 8-bit positive integer. 


Words 1 
Cycles label SBRK #k 


Cycle Timings for a Single Instruction 


Cycle oe fora Repeat (RPT) Execution 
Not Repeatable 


Example SBRK #0FFh 
Before Instruction After instruction 
ARP ARP 
AR7 AR7 [| COFFO1h} 


SETC Set Control Bit 


Syntax 


Operands 
Opcode 


Execution 
Description 


Words 
Cycles 


Example 
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[label] SETC control bit 
control bit: STO or ST1 bit (from : {C, CNF, HM, INTM, OVM, SXM, TC, XF}) 


Set overflow mode (OVM) 
15 14 13 12 if 


10 


N 
oO 
ol 
BS 
i é¥ 
NO 
—_ 
Oo 


1 
Set sign extension mode (SXM) 


Er 
° 
= 
tk 
“x 
— | 
© |o 
° 
i 
ro) 
ro) 
° 
ro) 
an 
=s 


15_ 14 #13 #12 #11 '°10 9 8 7 6 5 4 3 1. 0 
1 O 141 14 74 14 4 OF 808 4 0 0 0 14 71 ~°1 

Set hold mode (HM) 

15 14 13 #12 +11 +#«=10 8 7 6 5 4 3 2 1 9 
1 O 141 74 74 4 4+ O OO 41 0 0 141 0 0 41 

Set TC bit 


1514 +#13 :+#»12 #%11:+#«10 «9 8 7 2) | 
1 oO 1 1 1 1 1 0 0 14 0 0 14 0 1 


Set carry (C) 
15 14 13 12 11:10 9 8 7 6 5 4 3 2 i 0 


1 O 1 1 1 1 1 0 0 14 0 0 1 


Set XF pin high 
15 14 13 12 


ooh, 
— 
owsh, 


| 8 5 4 2 | 
1 O 1 1 1 1 1 0 0 14 0 0 1 1 oO 1 
Set CNF bit 
15. 14.1 +«13~«12~«d41—is10— 99 8B COU TLCCUGCUDG LUGS CU 
11 O 4 1 1 1 1 O 0 
Set INTM bit 
15 14 #13 :#12 %11 +#10 9 8 7 
1 O 1 71 = 1 1 1 0 0 #4 +0 


(PC) +1 — PC 
1 — control bit 


The specified control bit is set to 1. Note that LST may also be used to load 
STO and ST1. See subsection 3.6.3 for more information on each control bit. 


1 
[label] SETC control bit 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
Ls AE OC Os <n 


SETC TC ;TC is bit 11 of STI 


Before Instruction After Instruction 
ST STi 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label] SFL 


None 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 QO 
1 oO 1 1 1 1 1 0 0 0 0 0 14 0 0 1 


(PC) +1 — PC 

(ACC(31)) ~ C 
(ACC(30-0)) — ACC(31-1) 
0 — ACC(0) 

Affects C. 

Not affected by SXM bit. 


The SFL instruction shifts the entire accumulator left one bit. The least signifi- 
cant bit is filled with a zero, and the most significant bit is shifted into the carry 
bit (C). Note that SFL, unlike SFR, is unaffected by SXM. 


1 
[label SFL 


Cycle Timings for a Single Instruction 


Cycle Timings for a Repeat (RPT) Execution 
a ee ee 0 ee 


SFL 


Before Instruction After instruction 


ACC | 0B0001234h acc [1] 60002468h 
C G 
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SFLB Shift ACCB and Accumulator Left 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-208 


[label] SFLB 


None 


15 14 13 #12 11:°#10 9 8 7 6 5 4 3 2 1 0 
1 O 1 1 ~=«1 1 1 0 0 0 O 1 O 1 1 O 


(PC) +1 — PC 

0 - ACCB(0) 

(ACCB(30-0))  ACCB(31-1) 
(ACCB(31)) —» ACC(0) 
(ACC(30-0)) — ACC(31—1) 
(ACC(31) > C 


Affects C. 
Not affected by SXM bit. 


The SFLB instruction shifts the concatenation of the accumulator (ACC) and 
accumulator buffer (ACCB) left by one bit position. The least significant bit of 
the accumulator buffer is filled with a zero, and the most significant bit of the 
accumulator buffer is shifted into the least significant bit of the accumulator. 
The most significant bit of the accumulator is shifted into the carry bit (C). The 
SFLB instruction is unaffected by SXM. 


1 
[label SFLB 


Cycle Timings for a Single Instruction 


oe Cycle Timings for a Repeat (RPT) Execution | 


Before Instruction After Instruction 


ACC 0B0001234h ACC 60002469h 
Cc C 


ACCB 0B0001234h ACCB 60002468h 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 


Shift Accumulator Right SFR 


a nin sania aaa atk anata aa ak alata aha EON el Ratt etait itd 


[label] SFR 


None 


15 14 #13 #12 ~#«11:+#10 9 8 7 6 5 4 3 2 1 9 
1 O 1 1 = 1 1 1 0 0 0 0 0 14 0 14 QO 


(PC) +1 — PC 


If SXM = 0: 
ThenO — ACC(31). 
If SXM = 1 
Then (ACC(31)) — ACC(31). 


(ACC(31-1)) -» ACC(30-0) 
(ACC(0)) + C 


Affects C. 
Affected by SXM bit. 


The SFR instruction shifts the accumulator right one bit. 


If SXM = 1, the instruction produces an arithmetic right shift. The sign bit (MSB) 
is unchanged and is also copied into bit 30. Bit 0 is shifted into the carry bit (C). 


lf SXM = 0, the instruction produces a logic right shift. All of the accumulator 
bits are shifted right by one bit. The least significant bit is shifted into the carry 
bit, and the most significant bit is filled with a zero. 


1 
[label] SFR 


Cycle Timings for a Single Instruction | 


PDA PSA 


Cycle Timings for a a (RPT) Execution 


SFR ;(SXM = 0) 


Before Instruction After Instruction 


ACC 0B0001234h Acc [o] | 5800091Ah 
C Cc 


SFR ;(SXM = 1) 
Before Instruction After instruction 


ACC 0B0001234h acc [o] 0D800091Ah 
C Cc 
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SFRB Shift ACCB and Accumulator Right 


Syntax [label SFRB 
Operands None 
Opcode 


15 14 #13 :+#12 ~+#11:°:10 9 8 7 6 5 4 3 2 1 0 
1 O 1 71 = 1 1 1 0 0 0 0 14 0 1 1 1 


Execution (PC) +1 —- PC 


lf SXM=0: 
ThenO — ACC(31) 
lf SXM=1: 
Then (ACC(31)) — ACC(381) 


(ACC(31—1)) -» ACC(30-0) 
(ACC(0)) -» ACCB (31) 
(ACCB(31-1)) -» ACCB(30-0) 
(ACCB(0)) > C 


Affects C. 
Affected by SXM. 


Description The SFRB instruction shifts the concatenation of the accumulator (ACC) and 
accumulator buffer (ACCB) right by one bit position. The LSB of the ACCB is 
shifted into the carry bit. 


lf SXM=1, the instruction produces an arithmetic right shift. The sign bit (MSB) 
of the accumulator is unchanged and is also copied into bit 30. Bit 0 of the accu- 
mulator buffer is shifted into the carry bit (C). 


lf SXM=0, the instruction produces a logic right shift. All of the accumulator and 
accumulator buffer bits are shifted right by one bit. The least significant bit of 
the accumulator buffer is shifted into the carry bit, and the most significant bit 
of the accumulator becomes zero. 


Words 1 
Cycles [label SFRB 


Cycle Timings for a Single Instruction 


Example 1 SFRB ;(SXM = 0) 
Before Instruction After Instruction 
ACC acc [0] 
C C 
ACCB ACCB 


4-210 Assembly Language Instructions 


SFRB 


ROS ita eae ee ee a eee eee eee eo eee ee eee een enna ene eee eee nae nena ea eens een eee ee eee IN I NII III LIT RAR OOM SISOS EEO ITE IA 


Example 2 SFRB ;(SXM = 1) 
Before instruction After instruction 
ACC acc [0] 
C Cc 
ACCB ACCB 
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Syntax Direct: [label] SMMR dma, #adadr 
Indirect: [label] SMMR {ind}, #addr[, next ARP] 


Operands O = addr < 65535 
O0<dmas 127 
O = next ARP <7 
Opcode 


Direct: | 


13 = 
Fe aieace 1 — ae 4.1.2 
| 16-Bit Constant 


Execution PFC — MCS 

(PC) +2 — PC 

1K — PFC 

While (repeat counter = 0): 
(src, specified by lower 7 bits of dma) — (dst, addressed by PFC) 
(PFC) +1 — PFC 
(repeat counter) - 1 — repeat counter 

MCS — PFC 


Description The memory-mapped register value pointed at by the lower 7 bits of the data 
memory address is stored to the data memory location addressed by the 16-bit 
address, addr. The 9 MSBs of the data memory address of the 
memory-mapped register are set to zero, regardless of the current value of DP 
or the upper 9 bits of AR(ARP). This instruction allows any memory location 
on data page 0 to be stored anywhere in data memory without modifying the 
DP field in status register STO. When using the SMMR instruction with the RPT 
instruction, the destination address, #adadr, is incremented after every memo- 
ry-mapped store operation. 


Words 2 


Cycles Direct: [label] SMMR ama, #addr 
Indirect: [label] SMMR {ina}, #addr [, next ARP 


Cycle Timings for a Single Instruction 


Vestination DARAM 2+ZDcode 


Source MMR? 
Destination SARAM 2 2+2Dcode 
=> = 
Destination Ext 3+dust 3+dust 3+dyct 5+ dust+2Pcode 
somemee [fee 


4-212 Assembly Language Instructions 


| Destination DARAM 3+10sr¢ 3+10sr¢ 3+10sr¢ 4+10sr¢+2Pcode 
Source MMPORT 


Destination SARAM 
el 
Destination Ext 4+10ro+dgst 4+10sr¢+ gst 4+1057¢+gst 6+lOsr¢+dgst+2Pcode 
oom | |e ee 


Destination SARAM 2n 2N+2Dcode 
Destination Ext 3N+Ndgs¢ 3N+Ndys¢ 3N+Ndgst 3N+3+Ndystt+2Pcode 
Destination DARAM 2N+NiOgre 2N+NiOgre 2N+NiOgr¢ 2N+1+NlOsro+2Pcode 
Destination SARAM 2N+NIOs7¢ 2N+NOgrc 2N+Ni0s7¢ 2N+14+NiOs;¢+2Pcode 
sowcemirort | || auavagt | 
Destination Ext 5n—2+ndgstt+ 5n—-2+Ndgs¢t+ 5n—2+ndgs+ 5N+1+Ndgs¢t+NiOgrot 


t If the destination operand and the code are in the same SARAM biock. 
+ Add one more cycle if source is a peripheral memory-mapped register. 
$ Add nmore cycles if source is a peripheral memory-mapped register. 


Example 1 SMMR CBCR,#307h :(DP = 6, CBCR = 1Eh) 
Before Instruction After instruction 
Data Memory Data Memory 
307h 1376h 307h 5555h 
CBCR 5555h CBCR 5555h 
Example 2 SMMR *,#307h,AR6 ;(CBCR = 1Eh) 
Before Instruction After instruction 
ARP [Ld 


ARP [dl 
ARG OFO1Eh ARG OFOIEN 
h 


Data Memory Data Memory 
307h 307h 
CBCR CBCR 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


4-214 


None 


(PC) +1 — PC 
(ACC) — (shifted P register) — ACC 


Affects OV and C; affected by PM and OVM. 
Not affected by SXM. 


The contents of the P register, shifted as defined by the PM status bits, are sub- 
tracted from the contents of the accumulator. The result is stored in the accu- 
mulator. Note that SPAC is not affected by the SXM, and the P register is al- 
ways sign-extended. 


The SPAC instruction is a subset of LTS, MPYS, and SQRS. 
1 
[label SPAC 


ce ae —— } iP | | 


Cycle Timings for a Repeat (RPT) Execution 


SPAC ;(PM = 0). 


Before Instruction After Instruction 
P 10000000h P | 10000000h 


~__60000000h} 


> 
© 
© 
of 


| 70000000h acc [11] 
Cc 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Direct: [label SPH dma 
Indirect: [label SPH {ina} [,next ARF 


0 <dma <= 127 
0 =< next ARP <7 


15 14 13 12 11 10 9 8 7 6 5 4 3 #2 #1 0 
Direct:| 1 0 O0 Oa 1 1 O 1 Data Memory Address 

15 14 13 12 11 #10 9Q 7 6 5 4 3 2 1 +O 
Indirect! 1 0 0 0 1 1 =O za See Subsection 4.1.2 


(PC) +1 — PC 
(P register shifter output (31-16)) —> dma 


Affected by PM. 


— 10 


The high-order bits of the P register, shifted as specified by the PM bits, are 
stored in data memory. Neither the P register nor the accumulator is affected 
by this instruction. High-order bits are sign-extended when the right-shift-by-6 
mode is selected. Low-order bits are taken from the low P register when left 
shifts are selected. 


a 


Direct: [labe] SPH dma 
Indirect: [label] SPH {ina} [,next ARP 


Cycle Timings for a Single Instruction 


n 
Operand SARAM n 


a 
n 
Operand Ex ane zendep 


t If the operand and the code are in the same SARAM block. 


SPH  DAT3 ;(DP = 4, PM = 0). 


Before Instruction After Instruction 
P OFEOT9B44n P 
203h 4567h 203h OFEO7h 
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SPH Store High P Register 


Example 2 SPH *,AR7 ;(PM = 2) 
Before Instruction 
ARP a) 
ARG 
P 
Data Memory 
203h 
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After instruction 


ARP 
AR6 
P 

Data Memory 
203h 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Direct: [label SPL dma 
Indirect: [labe| SPL {ina} [,next ARP 


0 <dmas 127 
O < next ARP <7 


15 14 13 12 11109 8 7 6 5 4 3 2 1 0 
1 0 0 0 1 1 0 Of O| Data Memory Address 


i514 #13 #i2 ti 10 9 8 7 6 5 4 3 2 i O 


Indirect} 1 0 0 0 1 1 =O 0 See Subsection 4.1.2 


(PC) +1 — PC 
(P register shifter output (15—-0)) —> dma 


Affected by PM. 


Direct: 


The low-order bits of the P register, shifted as specified by the PM bits, are 
stored in data memory. Neither the P register nor the accumulator is affected 
by this instruction. High-order bits are taken from the high P register when the 
right-shift-by-6 mode is selected. Low-order bits are zero-filled when left shifts 
are selected. 


1 


Direct: [label] SPL dma 
Indirect: [label] SPL {ina} [,next ARP 


T If the operand and the code are in the same SARAM block. 


SPL DAT5 ;(DP = 1, PM = 2). 


Before Instruction After Instruction 
P OFE079844h P OFE079844h 

Data Memory Data Memory 
205h 205h 08440h 
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SPL _ Store Low P Register 


RE a ae a GPE Sot ae HS I aE Sa Na hs ING SCN aS DEN ES” CN EN oe eae Ea aN I aL ae IH oO ee BB HT HES OH oH an sea aH ONY aa 


Example 2 SPL *,AR3 ;(PM = 0). 

Before Instruction After Instruction 
ARP er) ARP ee 
AR2 AR2 

P p 

Data Memory Data Memory 
205h 205h 
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Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Example 1 


Example 2 


CML OC EO ITO A 


Store Parallel Long Immediate SPLK 


Direct: [abe] SPLK #/k,dma 
Indirect: [label SPLK #Ik, {ind} [,next ARP 


0<dmas 127 
0 < next ARP <7 
Ik: 16-bit constant 


15 14 °+#13 :~#12 ~*+W1:+#10 9 8 7 6 5 4 3 2 #1 =O 


16-Bit Constant 


(PC) +2 — PC 
Ik — dma 


The SPLK instruction allows a full 16-bit pattern to be written into any memory 
location. The parallel logic unit (PLU) supports this bit manipulation indepen- 
dently of the ALU so that the ACC is unaffected. 


2 


Direct: label SPLK #/k,dma 
Indirect: labell SPLK #lk, {ind} [,next ARP 


Cycle Timings for a Single Instruction 
Operand DARAM 


ee he 
ae Cae 
Operand SARAM a 


Operand Ext 
Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


SPLK #7FFFh,DAT3 ;(DP = 6) 


Before instruction After Instruction 
Data Memory Data Memory 
303h OFEO7h 303h 7FFFh 
SPLK #1111h,*+,AR4 
Before Instruction After Instruction 


ARP [_ ARP lene | 
AR4 300h AR4 


Data Memory Data Memory 


300h 300h 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 
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[label SPM constant 


0 < constant <3 


1514/13 12 11:10 9 8 7 6 5 4 3 2 1 O 
1° 0 1 41 4 +4 +1 =+4:°0 0 0 0 0 of Pmt 


t See Section 4.5. 


(PC) +1 —- PC 
Constant — product register shift mode (PM) status bits 


Affects PM. 
Unaffected by SXM. 


The two low-order bits of the instruction word are copied into the PM field of 
status register ST1. The PM status bits control the P register output shifter. 
This shifter has the ability to shift the P register output either one or four bits 
to the left or six bits to the right. The bit combinations and their meanings are 
shown below: 


PM Action 

00 No shift of multiplier output 

01 Output left-shifted 1 place and zero-filled 

10 Output left-shifted 4 places and zero-filled 

11 Output right-shifted 6 places, sign-extended; LSB bits lost. 


The left-shifts allow the product to be justified for fractional arithmetic. The right 
shift by six bits has been incorporated to implement up to 128 multiply-accu- 
mulate processes without the possibility of overflow occurring. PM may also 
be loaded by an LST #1 instruction. 


Cycle Timings for a Single Instruction 


Not Repeatable 


t Note that ADD, ADRK, LACL, MPY, SBRK, SPM, SUB, XC, and RPT are nonreneatable. 


SPM 3 ;Product register shift mode 3 is selected, causing 
;all subsequent transfers from the product register 
sto the ALU to be shifted to the right six places. 


Assembly Language Instructions 


Square and Accumulate Previous Product SQRA 


SEP ORES ERLE oo SE SOTO T OOS OCT TOO O TEU UCU UU UO UN UIE MEU RENEE MIEN SEU IE TOE ITO ama EERIE PIES NT IETS erereemsereeonererererarrmrmremerereremeemererenemrener meh erenes OEORORrReC eee EB RMAC mem ene em DECOR EES earn smeoeneareeereneremerenene mr OCDRSnCmeRencr mere eneremOnenmerteNM ON eat MIN MH HN HS ANH a talent ta aida 


Syntax Direct: [label SQRA dma 
Indirect: [labe] SQRA {ing} [,next ARP 
Operands 0 <dmas 127 
0 < next ARP <7 
Opcode 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 #1 0 
15 14 13 12 #11 #10 9 8 7 6 5 4 3 2 1 =O 
Indirect: [9 1 0 1 0 0 1 O|1]  SeeSubsection4i2 
Execution (PC) +1 — PC 
(ACC) + (shifted P register) — ACC 
(dma) — TREGO 
(dma) x (dma) — P register 
Affects OV and C. 
Affected by PM and OVM. 
Description The contents of the P register, shifted as defined by the PM status bits, are add- 
ed to the accumulator. The addressed data memory value is then loaded into 
TREGO, squared, and stored in the P register. 
Words 1 
Cycles 


| Cycle Timings for a Single Instruction | | 
1 . 1+p 


Operand SARAM 


t If the operand and the code are in the same SARAM block. 
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Example 1 SQRA DAT30 ;(DP = 6, PM= 0). 
Before Instruction 
Data Memory 
31Eh 
TREGO Cah 
P 
ACC 
Cc 
Example 2 SQRA *,AR4 ;(PM = 0). 
Before instruction 
ARP a) 
AR3 CS Eh 
Data Memory 
31Eh 
TREGO fs 
P 
ACC 
C 
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Data Memory 
31Eh 


TREGO 
P 
ACC 


AR3 


Data Memory 
31Eh 


TREGO 


[0] 


C 


After Instruction 


After Instruction 


Assembly Language Instructions 


Square and Subtract Previous Product SQRS 


b's tatate acute atetn otere'e-tateraatatecatateta'y eutelescie's'eletotecern aia's n's°0e piers atetstatainte ates sintb eo 'a'e sipveecpirseiecbin sein ern seret tar nrc bre n"s'ate"e elev eeatotcraierrin ss eratatststarh enh eeaete"s"sia cote canaa ai arataretarsrsracecan etattatarere gtk ceTaesre aceTsTeN CELT Orr staceen ata te nce Nebrenectrne boar eoe ese cara ca rare testa acar sere ote eresnena Pestataeress salaries tacecy a enters cerca elena cere at ers Te srarars cern 87g. STsTSrArasess ct OUT Ceasers oat egen ace er erOe eG Bee “ata ete aecees ee cata onnse pa scat ca uasessaides san ascend toate ote's eso stole feta celatete etsta treet caster tatet tec stetstetr ait ots 


Syntax Direct: [label SQRS dma 

Indirect: [label SQRS {ind} [,next ARP 
Operands 0 < dma <x 127 

0 = next ARP <7 
Opcode 


7 6 5 4 3 2 1 0 
| 0 | Data Memory Address 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 =#=0 


Indirect:| 0 1 0 1 #0 0 1 Sea See Subsection 4.1.2 


15 14 13 12 11 10 9 8 
Direct:| O 1 0 1 0 0 1 1 


Execution PC)+1—- PC 


( 

(ACC) — (shifted P register) - ACC 
(dma) — TREGO 

(dma) x (dma) — P register 


Affects OV and C. 
Affected by PM and OVM. 


Description The contents of the P register, shifted as defined by the PM status bits, are sub- 
tracted from the accumulator. The addressed data memory value is then 
loaded into TREGO, squared, and stored in the P register. 


Words 1 


Cycles Direct: [label SQRS dma 
Indirect: [label] SQRS {ino} [,next ARP 


Cycle Timings for a Single Instruction 
Operand DARAM 


ae ee 

ae hoe 
Operand SARAM 1 1+p 
eee ee 
ana FS 0 se 
Bl A 5A cE 


Operand SARAM 
— 
“Operand Ex 


t If the operand and the code are in the same SARAM block. 
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SQRS = Square and Subtract Previous Product 


Example 1 SQRS DAT9 ;(DP = 6, PM = 0). 

Before Instruction 
Data Memory 
309h 
TREGO 
i 
ACC X 
Cc 
Example 2 SQRS *,AR5 ;(PM = 0) 


Before Instruction 


ARS [09h] 


Data Memory 


309h 
TREGO 
p 
ACC 
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After instruction 


Data Memory 
309h 
TREGO 
P 
acc 81 [____12¢hh) 
Cc 
After instruction 
ARP eee 
AR3 
Data Memory 
309h 
TREGO O8h 
P 
acc [1] 
C 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


Direct: [label] SST #n, dma 
Indirect: [label SST #n, {ind} [,next ARP 


0<dma s 127 
n=0,1 
0 < next ARP <7 


Store Status RegisterO SST#0 


1514 13 te tt 109 Ee EC ae Se — ee SeE 


Direct: 


7 6 5 4 3 2 1 =O 


Indirect:} 1 0 0 0 1 1 #1 «0 See Subsection 4.1.2 


Store Status Register1 SST#1 
15 14 13 12 11 10 9 8 7 6 5 4 3 #2 = 1 0 


Direct:} 1 0 O 0 1 #14 #1 #140) Data Memory Address 


15 14 13 12 #11 =#10 9 


15 14 13 12 11°10 9 8 6 5 4 3 2 +1 #20 
1 


7 
Indirect:| 1 0 0 0 1 #1 1 See Subsection 4.1.2 


(PC) +1 — PC 
(status register STn) — dma 


Status register STn is stored in data memory. In the direct addressing mode, 
status register STn is always stored in page 0, regardless of the value of the 
DP register. The processor automatically forces the page to be 0, and the spe- 
cified location within that page is defined in the instruction. Note that the DP 
register is not physically modified. This allows storage of the DP register in the 
data memory on interrupts, etc., in the direct addressing mode without having 
to change the DP. In the indirect addressing mode, the data memory address 
is obtained from the auxiliary register selected (see the LST instruction for 
more information). In the indirect addressing mode, any page in data memory 
may be accessed. 


Status registers STO and ST1 are defined in subsection 3.6.3, Status and Con- 
trol Registers. 


onh, 


Direct: [label SST #n, dma 
Indirect: [label| SST #n, {ind} [,next ARP 


Cycle Timings for a Single Instruction 


Example 1 


Example 2 
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Operand DARAM 
Operand SARAM 


Operand Ext 


t If the operand and the code are in the same SARAM block. 


SST #0 ,DAT96 


STO 


Data Memory 
60h 


SST #1,*,AR7 


ARP 
ARO 
ST1 


' Data Memory 
300h 


;(DP = 6) 
Before Instruction 


OA408h 


Before Instruction 


STO 


Data Memory 
60h 


ARP 
ARO 
ST1 


Data Memory 
300h 


After Instruction 


__0A408h 
0A408h 


After Instruction 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Direct: 


Indirect: 


Direct: 


Indirect: 


Short: 


Long: 


Direct: [label SUB dma |[,shift7] 

Indirect: [label SUB {ind} [, shift? [,next ARP] 
Short Immediate: [label SUB #k 

Long Immediate: [labej SUB #l/k [,shiftZ 


Subtract from accumulator with shift 
15 14 13 12 11 109 8 7 6 5 4 3 2 1 0 


oO 0 1 #41]  SHFTT [0] Data Memory Address 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


roo 1 +] SHFT? [1] _ SeeSubsectionat2 


Subtract from accumulator with shift of 16 
15 14 13 12 11109 8 7 6 5 4 3 2 1 0 


Oo 1 1 0 O 14 0 1 Data Memory Address 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 =O 


0 1 1 0 O 1 0 1 1 See Subsection 4.1.2 


Subtract from ACC short immediate 
15 14 13 12 11109 8 7YF 6 5 4 3 2 1 0 


1 0 1 1 1 0 1 0 8-Bit Constant 


Subtract from ACC long immediate with shift 
15_ 14 13 12 11 10 9 8 7 6 5 4 3 2 1 QO 


16-Bit Constant 


Direct or Indirect Addressing: 


(PC)+1 —- PC 

(ACC) —[(dma) x 2shiftt) — ACC 
Affects C and OV. 

Affected by SXM and OVM. 


Short Immediate Addressing: 


(PC) +1 — PC 
(ACC) -k — ACC 
Affects C and OV. 
Affected by OVM. 
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SUB Subtract From Accumulator 


Description 


Words 


Cycles 


4-228 


Long Immediate Addressing: 
(PC)+2—-PC _ 
(ACC) — Ik x 2shift2 — ACC 


Affects C and OV. 
Affected by SXM and OVM. 


The contents of the addressed data memory location or a 16-bit constant are 
left-shifted and subtracted from the accumulator if direct, indirect, or long im- 
mediate addressing is used. During shifting, low-order bits are zero-filled. 
High-order bits are sign-extended if SXM = 1 and zero-filled if SXM = 0. The 
result is then stored in the accumulator. 


When short immediate addressing is used, an 8-bit positive constant is sub- 
tracted from the accumulator. In this case, no shift value may be specified, the 
subtraction is unaffected by SXM, and the instruction is not repeatable. 


The carry bit is reset to zero if the result of a subtraction generates a borrow; 
otherwise, it is set to 1. If a 16-bit shift is specified with the subtraction, the in- 
struction may reset the carry bit to 0 only if the result of the subtraction gener- 
ates a borrow; otherwise, C is unaffected. 


1. (Direct, indirect, or short immediate) 
2 (Long immediate) 


Direct: [label SUB dma |{,shift7] 
Indirect: paved. SUB {indt [ shift1 Lb net nel 


Oc OO 
Cpernaparam tt tt 
ii Aa PD 
[Operndee edad [ea 


Cycle Timings for a Repeat (RPT) Execution 


RSA PE 
KL A A CW 


~ Operand SARAM 


Operand Ex 


T If the operand and the code are in the same SARAM block. 
Short Immediate: [label SUB #k 


pee 
ee ee 


Assembly Language Instructions 


Example 1 


Example 2 


Example 3 


Example 4 


Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


Long Immediate: [label SUB #/k [,shifta 


Cycle Timings for a Single Instruction 


el ae (Ec ener 
Cycle Timings for a Repeat (RPT) Execution | 
Not Repeatable 


SUB  DAT80 ;(DP = 8, SXM=0) 


Before Instruction After instruction 
Data Memory Data Memory 
450h FC 450h [th 


ACC acc =) Lian] 
C C 


SUB *-—,1,ARO ;(SXM = 0) 


Before Instruction After Instruction 
ARP ARP 
Data Memory Data Memory 
AR7 AR7 
301h 301h 
ACC ACC 
Cc C 


SUB #8h ;(SXM = 1) 
Before Instruction After Instruction 


ACC acc. (o] OFFFFFFFFh 
C Cc 


SUB #OFFFh,4 ;(SXM = 0) 
Before Instruction After Instruction 


ACC | OFFFFh acc. [i] 
C C 
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Syntax Direct: [label SUBB dma 
Indirect: [label SUBB {ina} [,next ARP 


Operands 0<dmas 127 
0 s next ARP s 7 


Opcode 
15 14 138 12 11 10 9 8 7 6 5 4 3 2 +1 =#«0 


Direct:} 0 1 #1 O 0 1 O Of 0) Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 O 


Indirect} O 1 #1 0 0 1 0 Of 1 | See Subsection 4.1.2 


Execution (PC) +1 — PC 
(ACC) — (dma) — (logical inversion of C) — ACC 


Affects OV and C. 
Affected by OVM. 
Not affected by SXM. 


Description The contents of the addressed data memory location and the logical inversion 
of the carry bit are subtracted from the accumulator with sign extension sup- 
pressed. The carry bit is then affected in the normal manner. 


The SUBB instruction can be used in performing multiple-precision arithmetic. 
Words 1 


Cycles Direct: [label SUBB dma 
Indirect: [label] SUBB {ina} [,next ARP 


Cycle Timings for a Single Instruction 


Operand DARAM 


Pe 
_ oe 
Operand SARAM | 


Cycle Timings for a Repeat (RPT) Execution 


PR PDA PSA PE 


Operand SARAM n 
n+1t 


n+p 
n+p 
Operand Ex [rena nnd 


t If the operand and the code are in the same SARAM biock. 
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Example 1 


Example 2 


SUBB DAT5 ~ ;(DP = 8) 
Before Instruction After Instruction 


Data Memory Data Memory 
405h 405h 


acc [0] acc [o] OFFFFFFFFh 
Cc Cc 


SUBB * 
Before Instruction After instruction 
ARP ee ARP [sd 
AR6 ARG 
301h 301h 
ACC 04h acc [1] 02h 
C C 


In the first example, C is originally zeroed, presumably from the result of a pre- 
vious subtract instruction that performed a borrow. The effective operation per- 
formed was 6 — 6 — (0—) = —1, generating another borrow (resetting carry) in 
the process. In the second example, no borrow was previously generated 
(C=1), and the result from the subtract instruction does not generate a borrow. 
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SUBC Conditional Subtract 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
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Direct: 


Indirect: 


Direct: [labe] SUBC dma 
Indirect: [label] SUBC {ind} [,next ARP 


0<sdmas 127 
0 < next ARP <7 


7 6 5 4 3 2 1 0 
0 0 0 0 1 | 0 | Data Memory Address 


15 14 13 12 +11 #10 +9 7 6 5 4 3 2 1 =O 


0 00 01 0 1 ha See Subsection 4.1.2 


(PC) +1 + PC 
(ACC) — [(dma) x 215] — ALU output 


15 14 13 #12 #11 #10 +9 
0 1 


O1@o | O10 


If ALU output = 0: 
Then (ALU output) x 2 +1 — ACC; 
Else (ACC) x 2 — ACC. 


Affects OV and C. 
Affected by SXM. 
Not affected by SXM, and OVM (no saturation). 


The SUBC instruction performs conditional subtraction, which may be used for 
division. The 16-bit dividend is placed in the low accumulator, and the high ac- 
cumulator is zeroed. The divisor is in data memory. SUBC is executed 16 times 
for 16-bit division. After completion of the last SUBC, the quotient of the divi- 
sion is in the lower-order 16-bit field of the accumulator, and the remainder is 
in the higher-order 16-bits of the accumulator. SUBC assumes that the divisor 
and the dividend are both positive. The divisor is not sign extended. The divi- 
dend, which is in the accumulator, must initially be positive (that is, bit 31 must 
be 0) and must remain positive following the accumulator shift, which occurs 
in the first portion of the SUBC execution. 


If the 16-bit dividend contains fewer than 16 significant bits, the dividend may 
be placed in the accumulator and left-shifted by the number of leading nonsig- 
nificant zeroes. The number of executions of SUBC is reduced from 16 by that 
number. One leading zero is always significant. 


Note that SUBC affects OV but is not affected by OVM, and therefore the accu- 
mulator does not saturate upon positive or negative overflows when executing 
this instruction. The carry bit is affected in the normal manner during this in- 
struction. 


1 


Assembly Language Instructions 


Cycles 


Example 1 


Example 2 


Conditional Subtract | SUBC 


Direct: [label SUBC dma 
Indirect: [label] SUBC {ina} [,next ARP 


Cycle Timings for a Single Instruction 


ma 
[OperenspaRAM ttt 


ot 


POR Psa [PE 
[Sperans DARA tp 


Operand SARAM n n n+p 
n+it 
Operand Ext 


t If the operand and the code are in the same SARAM block. 


SUBC DAT2 ;(DP = 6) 


Before Instruction After Instruction 
Data Memory Data Memory 
302h 302h 
ACC 04h acc [o] 
C C 
RPT #15 
SUBC * 
Before Instruction After Instruction 
ARP ARP ae) 
AR3 AR3 
Data Memory Data Memory 
1000h 1000h 
ACC 41h ACC 
Cc C 
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Syntax Direct: [abe] SUBS dma 
Indirect: [label SUBS {ina} [,next ARF 


Operands 0s dmas 127 
O <= next ARP <7 


Opcode 
15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 0 
Direct} 0 1 #1 #0 O 1 1 #Of 0] Data Memory Address 


15 14 13 12 11 #10 9 8 6 5 4 3 2 1 0 


7 
Indirect} 0 1 1 0 0 1 1 «0 See Subsection 4.1.2 


Execution (PC) +1 — PC 
(ACC) — (dma) — ACC 


Affects OV and C; affected by OVM. 
Not affected by SXM. 


Description The contents of the specified data memory location are subtracted from the 
accumulator with sign extension suppressed. The data is treated as a 16-bit 
unsigned number, regardless of SXM. The accumulator behaves as a signed 
number. SUBS produces the same results as a SUB instruction with SXM = 
0 and a shift count of 0. 


and, 


Words 


Cycles Direct: [label SUBS dma 
Indirect: [/abel SUBS {ind} [,next ARP 


Cycle Timings for a Single Instruction 


[Operansparam 
+p 


[Operands ad dee 


Operand DARAM n 


no Pn [mp 

Operand SARAM n n+p 
eee fe 
Operandet | rend | mand | mend | netepend 


T If the operand and the code are in the same SARAM block. 
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raratatalatDatatecatsty aT etatatat aetna ene e hana gee atten ata as ctaha! ane a aaah gra deena nena eee ncaa sa cancers a gang fea fee see gages ceded ae 


Example 1 SUBS DAT2 ;(DP = 16, SXM = 1). 
Before Instruction 

Data Memory 
802h 
ACC 

C 
Example 2 SUBS * ;(SXM = 1) 

Before Instruction 
ARP ee ee 
ARO 

Data Memory 
310h 


ACC OFFFF105h 
Cc 


Subtract From Accumulator With Si 


ert aa an ayn a taal a aa aaa aH A al Waa HA alas aa aa aaa Aaa ata a ae aa a naa ta aati a aa a a ta aC tn nt ati ana nt nt aw n'a iawn A ne Wt a ar ar a 


PASSA AOI 


After Instruction 


Data Memory 
802h OFO03h 


ACC 
C 


After Instruction 


ARP Sane) 

ARO 
Data Memory 

310h OFOO03h 


acc [1] OFFFO102h 
Cc 


SUBT Subtract From Accumulator With Shift Specified by TREG1 


abit ete itt etait ai taitiiit ett aittab eit el tiibi ntti ete ttit iid ti hii ht ete RL LEED DOSER EEL ELE i Date titi tat tei atte Sats alt eesti leant aia teeta tiie itsaintteaaialat teed ataaalataSi etek e aati daiaid al Sal aiel catalaintat rial ial ahatal heal Siete a alae eee iiodd 


Syntax Direct: [abe] SUBT dma 

Indirect: [label SUBT {ind} [,next ARF 
Operands 0 < dma x 127 

O < next ARP <7 
Opcode 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 OQ 


Direct:|} 0 1 #1 0 0 141 =#141 ~=«1/ 077 Data Memory Address 


15 14 13 #12 +211 ~=«10 EE PE EI nee 
indrect[ 01100111] 1] Seo Sibsecion a2 
Execution (PC) +1 — PC 
(ACC) — [(dma) x 27REG1(3-0)} — (ACC) 
If SXM = 1: 
Then (dma) is sign-extended. 
If SXM = 0: 


Then (dma) is not sign-extended. 
Affects OV and C; affected by SXM and OVM. 


Description The data memory value is left-shifted and subtracted from the accumulator. 
The left-shift is defined by the four LSBs of TREG1, resulting in shift options 
from 0 to 15 bits. The result replaces the accumulator contents. Sign extension 
on the data memory value is controlled by the SXM status bit. 


Software compatibility with the C25 can be maintained by setting the TRM bit 
ofthe PMST status register to zero. This causes any ‘C25 instruction that loads 
TREGO to write to all three TREGs. 


Words | 


Cycles Direct: [label SUBT dma 
Indirect: [label SUBT {ind} [,next ARP 


Cycle Timings for a Single Instruction 
Operand DARAM ae ce 


Operand SARAM 
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Example 1 


Example 2 


oops sesssssssctsosessesiaxssiC RR CS SRS Ronepenenitecatacenccenntcn weleceddcnedenccecdcuelnad ceccancuocaddc ceded intokdbne cacti, 


Subtract From Accumulator With Shift Specified by TREG1 SUBT 


Pe Ponsa 
[Operands BARA a 


Operand DARAM pM 


Operand SARAM n | n+p 
n+1T 
Operand Ext Caend [rend [rend | netepend 


t If the operand and the code are in the same SARAM block. 


SUBT DAT127 ;(DP = 4) 
Before Instruction After Instruction 
Data Memory Data Memory | | 
2FFh 2FFh 
TREG1 TREG1 
ACC acc [1] 
C C 
SUBT * 
Before Instruction After Instruction 
ARP Eases ARP aaa 
ARI ART 
Data Memory Data Memory 
g00h B00h 
TREG1 08h TREG1 
ACC acc [0] 
C C 
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Syntax Direct: [label TBLR dma 
Indirect: [label TBLR {ind} [,next ARF 


Operands 0 <dmas 127 
0 < next ARP <7 


Opcode 
15 14 13 12 11 #210 6 5 4 3 2 1 QO 


9 8 7 
Direct} 1 O 1 0 O 1 1 #Of 0] Data Memory Address 


15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 =O 
1 


Indirect:|} 1 0 1 0 0 1 0 pel See Subsection 4.1.2 


Execution (PC) +1 — PC 
(PFC) — MCS 
(ACC(15—0)) — PFC 


If (repeat counter) = 0: 
Then (pma, addressed by PFC) — dma, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 —- PFC 
(repeat counter) —1 — repeat counter. 
Else (pma, addressed by PFC) — dma, 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


Description The TBLR instruction transfers a word from a location in program memory to 
a data memory location specified by the instruction. The program memory ad- 
dress is defined by the low-order 16 bits of the accumulator. For this operation, 
aread from program memory is performed, followed by a write to data memory. 
When the repeat mode is used, TBLR effectively becomes a single-cycle in- 
struction, and the program counter that contains the ACCL is incremented 
once each cycle. 


Words 1 


Cycles Direct: [label TBLR dma 
Indirect: [/abef TBLR {ind} [,next ARP 


Cycle Timings for a Single Instruction 
Source DARAM/ROM 
Destination DARAM 
Source SARAM 
Destination DARAM 


Source Ext 3+Psre 3+Dsre 3+Dsre 3+Psrc+Pcode 
Destination DARAM 
Source DARAM/ROM 3 3+Pcode 
Destination SARAM 4t 
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Cycle Timings for a Single Instruction (Continued) 


3 3+Pcode 

4t 

3+Dsro 3+ Psre 3+Psrc+Pcode 
4+Dsrc! 


4+Dsrco+dgst 6+Psrc+dast+ Pcode 


Cycle Timings for a Repeat (RPT) Execution 


Source SARAM 
Destination SARAM 


Source Ext 

Destination SARAM 
Source DARAM/ROM | 4+dast_ 
Destination Ext 
Source SARAM 
Destination Ext 


| Source Ext 
| Destination Ext 


3+Dsrc 


4+dust 


4+Dsrco+dgst 


Source DARAM/ROM | n+2 | n+ N+2+Deode 


| Destination DARAM 


2 
Source SARAM n+2 i n+2 
Destination DARAM 


N+2+Dcode 


ne 
2nt 


Source Ext 

Destination DARAM 

Source DARAM/ROM 
| Destination SARAM 
Source SARAM > 
Destination SARAM 


N+2+MDorc N+24NDPsro+Pcode 


n+2 n+2 N+2+Dcode 


n+4t 


N+2+Dcode 
2nt 


n+2 n+2 


2nt 2nt 


| 2n+2s | 
| Source Ext | N+2+NDsyo | +24+NDeyc N+2+NnDsrcet+Pcode 
| Destination SARAM | | N+4+nPsyc! : 
| Source DARAM/ROM | 2n+2+ndgc¢ 2n+2+ndst 2n+4+ndast+Pcode 
Destination Ext 
2n+2+ndysy 2n+4+ndyst+Pcode 


Source SARAM _ 


| Destination Ext 
4n+npsrotNdgse |  4N+NDsrotNdysr 4n+2+npercotNdgstt 
| Pcode 


| Source Ext 
Destination Ext 


2n+2+Ndyst 


t If the destination operand and the code are in the same SARAM block. 
+ if both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 
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TBLR Table Read 


Example 1 


Example 2 
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TBLR DAT6 ¥ ;(DP = 4) 
Before instruction 


ACC CL 23h] 
Program Memory 
23h 
Data Memory 
206h 


TBLR *, ART 
Before Instruction 


ARP eT 


ARO 300h 
ACC 
Program Memory 
24h 
Data Memory 
300h 


After Instruction 


ACC [23h] 
Program Memory 
23h 
Data Memory 
206h 
After Instruction 
ARP 
ARO 
ACC 
Program Memory 
24h 307h 
Data Memory 
300h 
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Table Write TBLW 


Syntax Direct: [label TBLW dma 
Indirect: [/abef] TBLW {ina} [,next ARP 
Operands 0<dmas 127 
O <= next ARP <7 
Opcode 
15 14 13 12 11 10 9 8 7 6 § 4 3 2 +1 = #=0 
Direct:[ 1 0 1 0 0 1 14 1 ]0]  DataMemoryAddress 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 +1 =O 
Indirect[ 1 0 1 0 0 1 1 1] 1]  SeeSubsection412 
Execution (PC) +1 — PC 
(PFC) — MCS 


(ACC(15-0)) + PFC 


If (repeat counter) = 0: 
Then (dma, addressed by PFC) — pma, 
Modify AR(ARP) and ARP as specified, 
(PFC) + 1 — PFC 
(repeat counter) —1 —> repeat counter. 
Else (dma, addressed by PFC) — pma, 
Modify AR(ARP) and ARP as specified. 
(MCS) — PFC 


Description The TBLW instruction transfers a word in data memory to program memory. 
The data memory address is specified by the instruction, and the program 
memory address is specified by the lower 16 bits of the accumulator. A read 
from data memory is followed by a write to program memory to complete the 
instruction. When the repeat mode is used, TBLW effectively becomes a sing- 
le-cycle instruction, and the program counter that contains the ACCL is in- 
cremented once each cycle. 


Words 1 


Cycles Direct: (label TBLW dma 
Indirect: [/labef] TBLW {ind} [,next ARP, 


(PROT POA PSA 
Source DARAM 3+Pcode 
Source SARAM 3+Pcode 
oeaiiononnan || 
Source Ext 34+sr¢ 3+dsr¢ 3+dsr¢ 3+dsre+Pcode 
joodinionoaran [off 
Destination SARAM 3 3+Pcode 
erie a 
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TBLW Table Write 


Source SARAM ro 3 3+Peode 
4t 


Source Ext. 3+deyo 3+dey 3+dsr ‘T 834dsrctPcode | 
Destination SARAM | 4+de,¢1 
Source DARAM 4+Dast 4+Dast | 4+Past 5+PasttPcode | : 
Destination Ext | 
Source SARAM | 4+Dast 4+Past 4+Dast 5+PasttPcode 
Destination Ext 


| 4+dgro+Past 4+dsr¢+Past | 5+dsr¢+PasttPcode 


Source Ext 4+dsro+Pdst 
Destination Ext 


| Source DARAM n+2 n+2 n+2 | N+2+Deode 
| Destination DARAM | 


Source SARAM — n+2 n+2 n+2 n+2+Dooue = 
Destination DARAM 


Source Ext n+2+ndero n+2+ndoy N+2+ndeo ~+2+nderetPcode 
Destination DARAM 


Source DARAM n+2 n+2 } n+2 N+2+Dcode 
| Destination SARAM n+3t 
Source SARAM T n+2 n+2 N+2+Dcode 
Destination SARAM 2nt 2nt | 2nt 
2n+18§ 
Source Ext N+24+Ndgr¢o nN+2+Nderco n+2+ndeyco n+2+ndero+Pcode 
Destination SARAM _ | N+3+ndgyot 
‘Source DARAM 2n+2+npast  —«|-s« 2N+ Vase 2n+2+NDPacy 2n+3+NPusttPcode _—i| 
Destination Ext 


Source SARAM 2N+2+NPgst 2N+2+NPgst 2N+2+NPgst 2N+3+NPgstt+Pcode 
Destination Ext 


Source Ext | 4n+ndgro+NPgst 4n+ndgrot+NPgst 4n+ndsre+NPgst 4n+1+Nndoro+NPgsrt+ 
Destination Ext | Pcode 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 
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Table Write TBLW 


Example 1 TBLW 
ACC 


Data Memory 
1905h 


Program Memory 
257h 


Example 2 TBLW * 


ARP 
AR6 
ACC 


Data Memory 
1006h 


Program Memory 
258h 


DAT5 ;(DP 


4340h 
307h 


ACC 


Data Memory 
1905h 


Program Memory 
257h 


ARP 
AR6 
ACC 


Data Memory 
1006h 


Program Memory 
258h 


After Instruction 


as a 


4399 


After instruction 


258 


4340 


4340 
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Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 
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[label] TRAP 


None 


15 14 13 #12 #11 :+#10 9 8 7 6 5 4 3 2 +i 0 
1 O 1 1 1 ~=1 ~=1 «0 0 1 0 1 0 0 0 +1 


(PC) +1 — stack 
22h — PC 


Not affected by INTM; does not affect INTM. 


The TRAP instruction is a software interrupt that transfers program control to 
program memory location 22h and pushes the program counter plus one onto 
the hardware stack. The instruction at location 22h may contain a branch in- 
struction to transfer control to the TRAP routine. Putting the PC + 1 onto the 
stack enables a return instruction to pop the return PC (points to the instruction 
after the TRAP) from the stack. The TRAP instruction is not maskable. 


, 
[label] TRAP 


Cycle Timings for a Single Instruction 


t The 'C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 


TRAP ;Control is passed to program memory location 22h and 
7PC + 1 is pushed onto the stack. 


Assembly Language Instructions 


Syntax 


Operands 


Opcode 


Execution 


Description 


Words 
Cycles 


[label XC k [,cond1] [,condZ [,...] 


k=1or2 

Conditions: ACC=0 EQ 
ACC0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C=0 NC 
C=1 C 
OvV=0 NOV 
OV=1 OV 
BIO low BIO 
TC=0 NTC 
TC=1 TC 
Unconditional UNC 

“A 12 5 4 3 2 1 0 


t[NiPo 1] wi | wet] —2wer 


t See Section 4.5. 


If (condition(s)) 
Then next k instructions executed 
Else execute NOP'’s for next k instructions 


lf k = 2 and conditions are met, the one two-word instruction or two one-word 
instructions following the XC instruction execute. If k = 1 and conditions are 
met, the one-word instruction following the XC instruction executes. If the con- 
ditions are not met, one or two NOPs are executed. Note that not all combina- 
tions of conditions are meaningful. The XC instruction and two-instruction 
words following the XC are uninterruptible. 


Cycle to for a Repeat (RPT) Execution 
Not Repeatable 
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XC Execute Conditionall 


Example XC 1,LEQ,C 
MAR *+ 
ADD DAT100 


If the accumulator contents are less than or equal to zero and the carry bit is 
set, the ARP is modified prior to the execution of the ADD instruction. 
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Syntax 


Operands 


Opcode 


Execution 


Description 


Words 


Direct: [label XOR dma 
Indirect: [label XOR {ind} [,next ARP 
Long Immediate: [label XOR #lk, [,shiffj 


Osdmas 127 

0 < next ARP <7 
Ik: 16-bit constant 
0 < shift s 16 


15 14 13 12 11 +#=10 =@Q 7 6 5 4 3 2 1 =O 


Indirect:} 0 1 1 0 14 1 =40 ex See Subsection 4.1.2 


8 
0 
15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 #O 
0 
8 


156 _ 14 #13 #12 #%11 +#«10 #Q 


16-Bit Constant 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 
1 O 1 71 ~=1 1 1 0 1 0 0 0 0 0 1 1 


16-Bit Constant | 


XOR with ACC long immediate with shift of 16 
t See Section 4.5. 


Direct or Indirect Addressing: 

(PC) +1 — PC 

(ACC(15—0)) XOR dma — ACC(15—-0) 
(ACC(31-—16)) —> ACC(31-—16) 


Long Immediate Addressing: 
(PC) +2 — PC . 
(ACC(31-0)) XOR (Ik x 2shift) — ACC(31-0) 


With direct or indirect addressing, the low half of the accumulator is XORed 
with the contents of the addressed data memory location; the upper half of the 
accumulator is unaffected. With immediate addressing, the long immediate 
constant is shifted and zero-extended on both ends and XORed with the high- 
and low-order bits of the accumulator. The carry bit (C) is unaffected by XOR. 


oud, 


(Direct or indirect addressing) 


NO 


(Long immediate addressing) 
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XOR__ Exclusive-OR With Accumulator 


Cycles 


Example 1 


Example 2 


Example 3 
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Direct: [label] XOR dma 
Indirect: [label XOR {ind} [,next ARF 


aE 
OperansDARAM Tt} ttf 


2 


Cycle Timings for a Repeat (RPT) Execution 


t If the operand and the code are in the same SARAM block. 
Long Immediate: [label XOR #lk, [,shiffj 


XOR DAT127 ;(DP = 511) 
Before Instruction After Instruction 


Data Memory Data Memory 


OFFFFh OFOFOh OFFFFh OFOFOh 
ACC 12345678h ACC 1234A688h 
Cc Cc 


XOR *+,AR0 


Before Instruction After Instruction 
ARP ARP eee) 
AR7 300h AR7 
Data Memory Data Memory 
300h OFFFFh 300h OFFFFh 
of ee | en | aaa ETT TITEL | a eacoas a en | | See TTT | 
ACC [xX] | 1234FOF On] ACC [xX} | 12340F0Fh| 
C C 
XOR #0FOFOh,4 
Before Instruction After Instruction 
ACC 11111010h ACC 111E1F10h 
C C 
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Exclusive-OR of ACCB With Accumulator KORB 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label XORB 


None 


1514: =#13:~#«12~«11 #10 9 8 7 6 5 4 3 2 1 JO 
1 O 1 1 1 1 1 0 0 0 0 1 1 O 71 OQ 


(PC) +1 — PC 
(ACC) XOR (ACCB) — ACC 


The contents of the accumulator buffer (ACCB) are exclusive-ORed with the 
contents of the accumulator. The results are placed in the accumulator, and 
the accumulator buffer is unaffected. 


1 
[label] XORB 


Cycle Timings for a Single instruction 


XORB 
Before instruction After Instruction 
ACCB ACCB |___ OFOFOFOF Oh! 
ACC ACC 
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XPL Exclusive-OR Data Memory Value 
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Syntax Direct: [label] XPL [#/k,] dma 

Indirect: [label] XPL [#/k,] {ina} [,next ARP| 
Operands 0sdmas 127 

lk: 16-bit constant 

0 <= next ARP <7 
Opcode 


XOR DBMR with data value 
15 14 13 12 11109 8 7 6 5 4 3 2 1 0 


Direct} 0 1 O 1 #1 #O O Of O| Data Memory Address 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 


indirect|] O 1 #O 1 #14 #0 0 Of 1] See Subsection 4.1.2 


XOR long immediate with data value 
15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 #9 


Direct: O _' 9 1 1 1 0 0 0. | Data Memory Address 
16-Bit Constant 


15 14 #13 #12 ~«+111:+#10 9 8 7 6 5 4 3 2 1 0 


inaireek: 0 1 0 1 1 1 0 See Subsection 4.1.2 


16-Bit Constant 


Execution Ik unspecified: 
(PC) +1 — PC 
(dma) XOR (DBMR) — dma 


lk specified: 

(PC) +2 — PC 

(dma) XOR Ik — dma 
Affects TC. 


Description If along immediate constant is specified, it is XORed with the addressed data 
memory value. If it is not specified, the addressed data memory value is 
XORed with the contents of the dynamic bit manipulation register (DBMR). In 
either case, the result is written back into the specified data memory location, 
and the accumulator contents are not disturbed. If the result of the XOR opera- 
tion is 0, then the TC bit is set to 1. Otherwise, the TC bit is set to 0. 


Words 1. (Long immediate value not specified) 
2 (Long immediate value specified) 


Cvcles Direct: [label XPL [#/k,] dma 
Indirect: [label XPL [#/k,] {ina} [,next ARP 
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Exclusive-OR Data Memory Value XPL 
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Cycle Timings for a Single Instruction 


Operand SARAM 


Operand Ext 4n-2+2nd | 4n-2+2nd | 4n—2+2nd 4n+1+2nd+p 


t If the operand and the code are in the same SARAM block. 


Direct: [label] XPL [#/k,] dma 
Indirect: [label XPL [#I/k,] {ind} [,next ARP 


[POR | PSA 
i 
i . 


Operand Ext 4n-1+2nd | 4n-1+2nd | 4n—1+2nd | 4n+2+2nd+2p 


t If the operand and the code reside in same SARAM block. 


Example 1 XPL #100h,DAT60 ;(DP = 0) 
Before Instruction After instruction 
Data Memory Data Memory 
60h 60h 
Example 2 XPL DAT60 ;(DP=0) 
Before Instruction After Instruction 
DBMR DBMR 
Data Memory Data Memory 
60h 60h 
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XPL __Exclusive-OR Data Memory Value 


Example 3 XPL 


ARP 
ARO 


Data Memory 
300h 


Example 4 XPL *—, ARO 


ARP 
AR6 
DBMR 


Data Memory 
301h 
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#1000h, *,AR6 


Before Instruction 


| 


OFFOOh 


Before Instruction 


| 


| OFFOOh| 


| OEFOOh 


After instruction 


ARP 
ARO 
Data Memory | 
300h 
After Instruction 
ARP a) 
ARG 
DBMR 
Data Memory 
301h 


Assembly Language Instructions 


Syntax Direct: (label ZALR dma 
Indirect: [label ZALR {ind} [,next ARP 


Operands 0<dmas 127 
O < next ARP <7 


Opcode 
15 14 13 12 11 10 Q 7 6 5 4 3 2 +1 ~=«0 


8 
Direct:} O 1 1 O 141 0 0 0 Data Memory Address 


15 14 13 #12 #11 #10 9 8 7 6 5 4 3 2 i 9 


Indirect} 0 1 #1 0 1 0 0 Of 1) See Subsection 4.1.2 


Execution (PC) +1 — PC 
8000h — ACC(15—0) 
(dma) — ACC(31-—16) 


Description To load a data memory value into the high-order half of the accumulator, the 
ZALR instruction rounds the value by adding 1/2 LSB; that is, the 15 low bits 
(bits 0-14) of the accumulator are set to zero, and bit 15 of the accumulator 


is set to one. 
Words 1 
Cycles Direct: [label ZALR dma 
Indirect: [label ZALR {ind} [,next ARP 
POpernsoaRAM TT 
ot 
Cycle Timings for a Repeat (RPT) Execution 
rae 6 ae ES Ee te 
Operand SARAM n n+p 
CL fiw | 
t If the operand and the code are in the same SARAM block. 
Example 1 ZALR DAT3 ;(DP = 32) 
Before Instruction After Instruction 
Data Memory Data Memory 


1003h SFOth 1003h SFOth 


ACC _77FFFFh ACC 3F018000h 
Cc Cc 
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After Instruction 


OFEFFh 


ACC OEOE08000h 
Cc 


Assembly Language Instructions 


Syntax 


Operands 
Opcode 


Execution 


Description 


Words 
Cycles 


Example 


[label ZAP 
None 


15 14 13 12 11 #10 9 8 7 6 5 4 3 2 1 0O 
1 O 1 1 1 1 1 0 0 4 0 1 1 O O 1 


(PC) +1 — PC 
0 — ACC 
0 — PREG 


The accumulator and product register are zeroed. The ZAP instruction speeds 
up the preparation for a repeat multiply/accumulate. 


awk, 


Cycle Timings for a Repeat (RPT) Execution 
a ae eee cee 


ZAP 
Before Instruction After Instruction 
PREG PREG 
ACC ACC 
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ZPR_ Zero Product Register 
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Syntax [label ZPR 
Operands None 
Opcode 


15 14 13 12 11°10 9 8 7 6 5 4 3 2 i O 
1 O 14 71 = 1 1 #1 0 0 1 0 1 #1 #0 0 0 


Execution (PC) +1 — PC 
0 — PREG 
Description The product register is set to zero. 
Words 1 
Cycles [label ZPR 


7 Cycle Timings for a Single Instruction _ | | 


| Cycle Timings for a iniaaa (RPT) Execution 


Example ZPR 
Before instruction After Instruction 
PREG 3F0111 1 th PREG __00000000h 
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'C2x-to-’C5x Instruction Set Mapping 


4.4 ’C2x-to-’C5x Instruction Set Mapping 


Table 4—5 provides a map between the ’C2x and ’C5x instruction sets. The Tex- 
as Instruments ’C5x assembler accepts instruction mnemonics from either in- 
struction set. Because the ’C5x instruction set is a superset of the ’C2x instruc- 
tion set, there are some ’C5x instructions that do not appear in the table. 


Table 4—6. Mapping Summary 


|__ Accumulator Memory Reference Instructions | 
|__“C2xMnemonio | CSxMnemonic 
a 
ADDC ADDC 
ADDH — ADD 
es... Aa a 
ADDT ADDT 
ADLK _ ADD 
N 
ANDK 
CMPL 


Mi 
1, 
O 


> 
za 
0 


CMPL ~ 
LACC 


5 
e) 


| > 
oO 
> 
za 
O 


LACK LACL 
LALK LACC 
NEG NEG 


NORM NORMt 


alo 

i 
a 
D 
it 
© 


t There is a potential pipeline conflict with the NORM instruc- 
tion. See the NORM instruction summary for details. 
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Table 4—6. Mapping Summary (Continued) 
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Accumulator Memory Reference Instructions 
(Concluded) 


’C2x Mnemonic 
SUBC 


SUBH 


UB 
SUBS 
raCL 
AC 
ZALR LR 
ZALS LACL 


SUBC 


S 
S 
ZA 


Auxiliary Registers and Data Page Pointer 
Instructions 


Assembly Language Instructions 


’C2x-to- 'C5x Instruction Set Mapping 


Table 4—6. Mapping Summary (Continued) 


T Register, P Register, and Multiply Instructions 
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’C2x-to-’C5x Instruction Set Mapping 


Table 4—6. Mapping Summary (Continued) 
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Branch/Call Instructions (Concluded) | 
’C2x Mnemonic 


; BIOZ BCND 
BLEZ BCND 


BCND | 


ae 

ae 
a 
a 
RET 


a 
*C2x Mnemonic | '€5xMnemonic 
a 


| BLKP BLPD 


t The suggested mapping requires that the data page pointer 
be set to 0. 


Assembly Language Instructions 


‘C2x-to-'C5x Instruction Set Mapping 


Table 4—6. Mapping Summary (Concluded) 


Control Instructions 


4-261 


Instruction Set Opcode 


4.5 Instruction Set Opcode 


This section summarizes the opcodes of the instruction set for the 'C5x digital 
signal processors. This instruction set is a superset of the 'C1x and ’C2x in- 
struction sets. The instructions are arranged according to function and are al- 
phabetized within each category. 


The following symbols are used in the opcode table: 


Symbol Meaning 
Data memory address bit. 
Three-bit field containing the auxiliary register value (0 — 7). 


BIT X 


Four-bit field specifies which bit to test for the BIT instruction. 
See CMPR instruction. | 


Addressing mode bit. O= direct addressing mode 
1 = indirect addressing mode 


Short Immediate value. 


Interrupt vector number. 


O >| > 
= Be 
x< 


INTR# 


Constant copied into PM bits in status register ST1. See SPM instruction. 
Three-bit shift value. 


S HF T Four-bit shift value. 


Field for the XC instruction indicating the number of instructions (one or two) to condi- 
tionally execute. N=1 One instruction to execute. 


” 
a 
nn 


N=2 Two instruction to execute. 


+ 
-y 


Two bits used by the conditional execution instructions to represent the conditions TC, 
NTC, and BIO. 

TP Meaning 

00 BIO low 

01 TC=1 

10 TC=0 

11 None of the above condition. 


ZLVC Four-bit field representing the following conditions: 


Zz: ACC=0 
L: ACC<0O 
V: Overflow 
C: Carry 


A conditional instruction contains two of these four-bit fields. The four-LSB field of the | 
instruction is a four-bit mask field. A one in the corresponding mask bit indicates that 
condition is being tested. The second four-bit field (bits 4 — 7) indicates the state of the 
conditions designated by the mask bits as being tested. For example, to test for ACC. 
z 0, the Z and L fields are set, while the V and C fields are not set. The next four-bit 
field contains the state of the conditions to test. The Z field is set to indicate to test the 
condition ACC = 0, and the L field is reset to indicate to test the condition ACC 20. The 
conditions possible with these 8 bits are shown in the BCND, CC, and XC instructions. 
To determine if the conditions are met, the four LSB bit mask is ANDed with the condi- 
tions. If any bits are set, the conditions are met. 


+ 1 word Indicates the instruction is a two-word instruction. The second word is a 16-bit long 


immediate value or a 16-bit program memory address for immediate addressing. 
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Table 4-7. Opcode Summary 


Instruction Set 


Accumulator Memory Reference Instructions 
[instruction [Mnemonic [ Opcode 


Absolute value of accumulator 

Add ACCB to accumulator with carry 

Add to accumulator with shift 

Add to low ACC short immediate 

Add to ACC long immediate with shift 

Add to accumulator with shift of 16 

Add to accumulator with carry 

Add ACCB to accumulator 

Add to low accumulator with sign suppressed 
Add to ACC with shift specified by TREG1 
AND accumulator with data value 

AND with ACC long immediate with shift 
AND with ACC long immediate with shift of 16 
AND ACCB with accumulator 

Barrel shift accumulator right 
Complement accumulator 

Store ACC in ACCB if ACC > ACCB 
Store ACC in ACCB if ACC< ACCB 
Exchange ACCB with accumulator 

Load accumulator with ACCB 

Load accumulator with shift 

Load ACC long immediate with shift 

Load ACC with shift of 16 

Load low word of ACC with immediate 
Load low word of accumulator 

Load ACC with shift specified by TREG1 
Load ACCL with memory-mapped register 
Negate accumulator 

Normalize accumulator 

OR accumulator with data value 

OR with ACC long immediate with shift 
OR with ACC long immediate with shift of 16 
OR ACCB with accumulator 

Rotate accumulator 1 bit left 

Rotate ACCB and accumulator left 

Rotate accumulator 1 bit right 

Rotate ACCB and accumulator right 

Store accumulator in ACCB 

Store high accumulator with shift 

Store low accumulator with shift 

Store ACCL to memory-mapped register 
Shift ACC 16 specified by TREG1 [4] 

Shift ACC 0-15 specified by TREG1 [3,0] 
Subtract ACCB from accumulator 
Subtract ACCB from accumulator with carry 
Shift accumulator 1 bit left 

Shift ACCB and accumulator left 

Shift accumulator 1 bit right 

Shift ACCB and accumulator right 
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Table 4—7. Opcode Summary (Continued) 


Accumulator Memory Reference Instructions (Concluded) 


[stuction SS ~S*d:CMnomnlo | Oped 


Subtract from accumulator with shift 

Subtract from accumulator with shift of 16 
Subtract from ACC short immediate 
Subtract from ACC long immediate with shift 
Subtract from accumulator with borrow 
Conditional subtract 

Subtract from ACC with sign suppressed 
Subtract from ACC, shift specified by TREG1 
XOR accumulator with data value 

XOR with ACC long immediate with shift 
XOR with ACC long immediate with shift of 16 
XOR ACCB with accumulator 

Zero ACC, load high ACC with rounding 
Zero accumulator and product register 


+ 1 word 


+ 1 word 
+ 1 word 


Auxillary Registers and Data Page Pointer Instructions 


Add to AR short immediate 

Compare AR with CMPR 

Load AR from addressed data 

Load AR short immediate 

Load AR long immediate 

Load data page pointer with addressed data 
Load data page immediate 

Modify auxiliary register 

Store AR to addressed data 

Subtract from AR short immediate 


Parallel Logic Unit Instructions 


Teatustion «dz nome Oca 


AND DBMR with data value 

AND long immediate with data value 
Compare DBMR to data value 
Compare data with long immediate 
OR DBMR to data value 

OR long immediate with data value 
Store long immediate to data 

XOR DBMR to data value 

XOR long immediate with data value 


+ 1 word 
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Instruction Set Opcode 


Table 4—7. Opcode Summary (Continued) 


T Register, P Register, and Multiply Instructions 


inetricion———*d;CMnomnlo [Opodo 


Add product to accumulator 010 

Load high product register 

Load TREGO 

Load TREGO and accumulate previous product 

Load TREGO, accumulate previous product, and 
move data 

Load TREGO and load ACC with PREG 

Load TREGO and subtract previous product 

Multiply/accumulate 

Multiply/accumulate with data shift 

Mult/ACC with source ADRS in BMAR and DMOV 

Muit/ACC with source address in BMAR 

Multiply data value times TREGO 

Multiply TREGO by 13-bit immediate 

Multiply TREGO by long immediate 

Multiply TREGO by data, add previous product 

Multiply TREGO by data, ACC — PREG 

Multiply unsigned data value times TREGO 

Load accumulator with product register 

Subtract product from accumulator 

Store high product register 

Store low product register 

Set PREG shift count 

Data to TREGO, square it, add PREG to ACC 

Data to TREGO, square it, ACC — PREG 

Zero product register 


So 


+ 1 word 
+ 1 word 


BEEEEEE BEES 


+ 1 word 


BESEEEE 


2: 


ro 
© 
© 
© 
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Instruction Set Opcode 


Table 4—7. Opcode Summary (Continued) 


Branch Instructions 


Branch unconditional with AR update 
Branch unconditional with AR update delayed 
Branch addressed by ACC 

Branch addressed by ACC delayed 

Branch AR = 0 with AR update 

Branch AR = 0 with AR update delayed 
Branch conditional 

Branch conditional delayed 

Call subroutine addressed by ACC 

Call subroutine addressed by ACC delayed 
Call unconditional with AR update 

Call unconditional with AR update delayed 
Call conditional 

Call conditional delayed 

Software interrupt 

Nonmaskable interrupt 

Return 

Return conditional 

Return conditionally, delayed 


Return, delayed 

Return from interrupt with enable 

Return from interrupt 

Trap 

Execute next one or two INST on condition 


/O and Data Memory Operations 


a 


Block move from data to data memory 

Block move data to data DEST long immediate 
Block move data to data with source in BMAR 
Block move data to data with DEST in BMAR 
Block move data to PROG with DEST in BMAR 
Block move from program to data memory 
Block move Prog to data with source in BMAR 
Data move in data memory 

Input external access 

Load memory mapped register 

Out external access 

Store memory mapped register 

Table read 

Table write 
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+ 1 word 
+ 1 word 
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Instruction Set Opcode 


Table 4-7. Opcode Summary (Concluded) 


Control Instructions 
[instruction Mnemonic [Oped 


Test bit specified immediate BITX IAAA AAAA 
Test bit in data value as specified by TREG2 1111 IAAA AAAA 
Reset overflow mode 1110 0100 0010 
Reset sign extension mode 1110 0100 0110 
Reset hold mode 1110 0100 1000 
Reset TC bit 1110 0100 1010 
Reset carry | 1110 0100 1110 
Reset CNF bit 1110 0100 0100 
Reset INTM bit 1110 0100 0000 
Reset XF pin 1110 0100 1100 
Idle 1110 0010 0010 
Load status register 0 1110 IAAA AAAA 
Load status register 1 1111 IAAA AAAA 
No operation 1011 0000 0000 
Pop PC stack to low accumulator 1110 0011 0010 
Pop stack to data memory 1010 IAAA AAAA 
Push data memory value onto PC stack 0110 IAAA AAAA 
Push low accumulator to PC stack 1110 0011 1100 
Repeat instruction as specified by data 1011 IAAA AAAA 
Repeat next INST specified by long immediate 1110 1100 0100 
Repeat INST specified by short immediate | 1011 IIII IIII 
Block repeat 1110 1100 0110 
Ciear ACC/PREG and repeat next INST long | 1110 1100 0101 
immediate 7 
Set overflow mode 1110 0100 0011 
Set sign extension mode : 1110 0100 0111 
Set hold mode 1110 0100 1001 
Set TC bit 1110 0100 1011 
Set carry 1110 0100 1111 
Set XF pin high 1110 0100 1101 
Set CNF bit 1110 0100 0101 
Set INTM bit 1110 0100 0001 
Store status register 0 1110 IAAA AAAA 
Store status register 1 1111 IAAA AAAA 
Idle until interrupt — low power mode 1110 0010 0011 


+ 1 word 


+ 1 word 
+ 1 word 
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The seven peripheral interfaces connected to the ’C50, ’C51 and ’C53 core 
CPU are the serial port, TDM serial port, timer, software-programmable wait- 
state generators, I/O ports, divide-by-one clock, and XF and BIO pins. These 
peripherals are controlled through registers that reside in the memory map. 
The serial ports and timer are synchronized to the core CPU via interrupts. Pe- 
ripherals and peripheral control are discussed in this chapter as shown below. 


Topic Page 
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5.1 Peripheral Control 


Peripheral circuits are operated and controlled through access of memory- 
mapped control and data registers. The operation of the serial ports and timer 
is synchronized to the processor via interrupts or through interrupt polling. Set- 
ting and clearing bits can enable, disable, initialize, and dynamically reconfi- 
gure the peripherals. Data is transferred to and from the peripherals through 
memory-mapped data registers. When a peripheral is not in use, the internal 
clocks are shut off from that peripheral, allowing for lower power consumption 
when the device is in normal run mode or idle mode. 


5.1.1. Memory-Mapped Registers and I/O Ports 


Twenty-eight core processor registers are mapped into the data memory 
space, they are listed in subsection 3.4.1. In addition to these core registers, 
15 peripheral registers and 16 I/O ports are mapped into the data memory 
space. Table 5—1 lists the memory-mapped registers and I/O ports of the ’C5x. 
Note that all writes to memory-mapped peripheral registers require one addi- 
tional machine cycle. 


Table 5—1. Memory-Mapped Registers and I/O Ports 
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Table 5—1. Memory-Mapped Registers and I/O Ports (Continued) 


Memory-Mapped Core Processor Registers (Concluded) 


Address Description 


a 
"7 eras 
ro __[edorReger 
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ae 
a 
[as 
ae 
ar 


CBSR2 
CBER2 


ok —_ 
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DXR 33 Data Transmit Register 
SPC 34 Serial Port Control Register 


[Tow TeranitDataReaiter =| 


TIM 


io) 
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CR 


PDWSR 
lOWSR 
CWSR 
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TRCV 
TDXR 
TSPC 
TCSR ~ 
TRTA 
TRAD 
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interrupts 


ae oT 
Memory-Mapped I/O Portst 

a 
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t See Section 6.2 for memory-mapped I/O ports. 


The ’C5x devices have four external, maskable user interrupts (INT4—INT1) 
that external devices can use to interrupt the processor; there is one external 
nonmaskable interrupt (NMI). Internal interrupts are generated by the serial 
port (RINT and XINT), the timer (TINT), the TDM port (TRNT and TXNT), and 
the software interrupt instructions (TRAP, NMI, and INTR). Interrupt priorities 
are set so that reset (RS) has the highest priority and INT4 has the lowest prior- 
ity. The NMI has the second highest priority. 


This subsection explains interrupt organization and management. Vector-rela- 
tive locations and priorities for all internal and external interrupts are shown in 
Table 5—2. No priority is set for the TRAP instruction (used for software inter- 
rupts), but itis included here because it has its own vector location. Each inter- 
rupt address has been spaced apart by two locations so that branch instruc- 
tions can be accommodated in those locations. 


The interrupt vectors reside at locations determined by the five-bit IPTR field 
of the PMST and the address values shown in Table 5—2. The IPTR field is set 


Peripherals 


to zero upon device reset, resulting in the interrupt vectors mapping to O000h 
in the program memory space. The vectors’ program address can be re- 
mapped to the beginning of any of the 32 2K-word blocks composing program 
memory space. This is done by loading a five-bit block address (5 MSBs of a 
full 16-bit address) into the IPTR. For example, the vectors can be moved to 
the beginning of the on-chip program RAM of the ’C50 by loading IPTR with 
1. When an interrupt trap occurs, the value in the IPTR is loaded into the most 
significant five bits of the vector address, and the relative address of the inter- 
rupt causing the trap constitutes the 6 LSBs of the vector address. This relative 
addressing scheme applies to all interrupts as well as to the software trap. It 
does not apply to the reset vector, because the reset signal forces the IPTR 
to be set to zero. 


Table 5—2. Interrupt Locations and Priorities 
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When an interrupt occurs, it is stored in the 16-bit interrupt flag register (IFR). 
Note that this happens regardless of whether that interruptis currently enabled 
or disabled. Each interrupt sets a flag in IFR. The flag can be cleared in any 
of the following three ways: 


1) Device reset (RS active low), 
2) The program takes the interrupt trap, or 
3) The program writes a one to the appropriate bit in the IFR. 
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The IFR is located at address 6 in the data memory space and can be read to 
identify active interrupts and written to clear interrupts. The IFR register is laid 
out as follows: 


15 9 8 7 6 5 4 8 2 1 0 
| Reserved =| INT4TXNT TRNT XINT RINT TINT INTS INT2 INT? 


Note that the ’C5x uses only ten of the sixteen generic interrupt lines to the core 
CPU shown in Section 3.8. 


A one in a specific bit, when read, indicates an active interrupt. For example, 
if the IFR is read to be 0005h, then INT3 and INTT are active. A one can be 
written to a specific bit to clear the corresponding interrupt. In the example, if 
a one is written to bit zero (0001h to IFR), then the INTT7 interrupt would be 
cleared. In the above example, the value 0005h could be written back into the 
IFR to clear both pending interrupts. 


A corresponding interrupt flag is automatically cleared when the interrupt trap 
is taken. When the CPU accepts the interrupt and fetches the instruction at the 
interrupt vector location, it generates an interrupt acknowledge (TACK) signal 
that clears the appropriate interrupt flag bit. A hardware reset (RS active low) 
clears all pending interrupt flags. 


The ’C5x devices have a memory-mapped interrupt mask register (IMR) for 
masking external and internal interrupts. The layout of the register is as fol- 
lows: 


15 9 8 7 6 5 4 8 2 14 0 
| Reserved = [INT4TXNT TRNT XINT RINT TINT INTS INT2 INT 


A 1 in bit positions 8 through 0 of the IMR enables the corresponding interrupt, 
provided that INTM = 0. The IMR is accessible with both read and write opera- 
tions. Note that RS and NMI are not included in the IMR; the IMR has no effect 
on reset or a nonmaskable interrupt. 


Interrupts may be asynchronously triggered. In the functional logic organiza- 
tion for INT4—INTT, shown in Figure 5—1, the external interrupt INTn is syn- 
chronized to the core via a five flip-flop synchronizer. The actual implementa- 
tion of the interrupt circuits is similar to this logic implementation. A one is 
loaded into the IFR if a 1-1-0-0-0 sequence on five consecutive CLKOUT1 
cycles is detected. 


The ’C5x devices sample the external interrupt pins multiple times to avoid noi- 
se-generated interrupts. To detect an active interrupt, these devices must 
sample the signal low on at least three consecutive machine cycles. Once an 
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interrupt is detected, the devices must sample the signal high on at least two 
consecutive machine cycles to be able to detect another interrupt. The exter- 
nal interrupt pins are sampled on the rising edge of CLKOUT1. If the external 
interrupts are running asynchronously, the pulses should be stretched to guar- 
antee three consecutive low samples. 


Figure 5—1. External Interrupt Logic Diagram 
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If the INTM bit and mask registers have been properly enabled, the interrupt 
signal is accepted by the processor. An [ACK signal is then generated. The 
TACK clears the appropriate interrupt edge flip-flop and sets the INTM =1. The 
logic is the same for INT1—INT4. NMI uses the same logic, except that it is not 
affected by IMR or INTM status. 


The context of the interrupted code segment is automatically saved by the pro- 
cessor. When the processor takes the interrupt trap ([ACK goes active low), 
the accumulator, accumulator buffer, product register, index register, auxiliary 
compare register, STO, ST1 (except for the XF bit), PMST, and all three tempo- 
rary registers are pushed onto corresponding one-deep stacks. At the comple- 
tion of the ISR, the RETI (return from interrupt) or RETE instruction causes the 
stacks to be popped automatically to restore the interrupted code segment's 
context. Because these stacks are one deep, nesting of interrupts requires a 
software context save. However, the overhead is lowered considerably by the 
automatic context save. Therefore, it is usually more code efficient to serially 
execute multiple ISRs. 


5.1.3 Peripheral Reset 


Interrupt service routines can be invoked in software via the INTR instruction 
(see page 4-76 for details). 


A number of actions occur when the 'C5x is reset. Subsection 3.8.1 describes 
what happens in the ’C5x core when reset is activated. On a device reset, the 
core CPU sends an SRESET signal to the peripheral circuits. The SRESET 
signal has the following consequences in the peripheral circuits: 


1) 


2) 


The two software wait-state registers are set to OFFFFh, causing all exter- 
nal accesses to occur with 7 wait states. The CWSR is loaded with OFh. 
The FO bits of the SPC and TSPC registers are set to zero, selecting a 
word length of 16 bits for each serial port. 

The FSM bits of the SPC and TSPC registers are set to zero. FSM must be 
set to one for operation with frame sync pulses. 

The TXM bits of the SPC and TSPC are set to zero, configuring the FSX 
and TFSX pins as inputs. 

The SPC and TSPC registers are loaded with OyOOh, where the 2 MSBs of 
y are 10 (binary) and the 2 LSBs of y reflect the current levels on the trans- 
mit and receive clock pins of the respective port. 

The TIM and PRD registers are loaded with OFFFFh. The TDDR field of 
the TCR is set to zero. The timer is started. 
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5.2 Parallel Input/Output Ports 


The ’C5x devices have 64K parallel input/output ports. I/O port accesses are 
defined as accesses during which the I/O space select signal (IS) is active. Six- 
teen of the 64K ports are mapped in data memory space as shown in Table 
5-1. All 64K I/O ports can be accessed with the IN and OUT instructions. The 
16 memory-mapped I/O ports (50h—5Fh) can also be accessed via any in- 
struction that reads or writes a location in data space. RD can be used in con- 
junction with chip-select logic to generate an output enable signal for an exter- 
nal peripheral. The WE signal can be used in conjunction with chip-select logic 
to generate a write enable signal for an external peripheral. Figure 5-2 shows 
typical I/O port interface circuitry. Note that the decode section can be simpli- 
fied if fewer I/O ports are used. 


Figure 5-2. I/O Port Interface Circuitry 
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5.3 Software-Programmable Wait-State Generators 


Software-programmable wait-state generators can be used to extend external 
bus cycles by up to 7 machine cycles. This provides a convenient means for 
interfacing external devices that do not satisfy the full-speed access-time re- 
quirements of the ’C5x. Devices requiring more than 7 wait states can be inter- 
faced with the hardware READY line. When all external accesses are confi- 
gured for zero wait states, the internal clocks to the wait-state generator are 
shut off, allowing the device to run in a lower power mode of operation. 


The software-programmable wait-state generators are controlled by two 16-bit 
wait-state registers (PDWSR and IOWSR) and a 5-bit control register 
(CWSR). Each of the three external spaces (program, data, and I/O spaces) 
has an assigned field in a software wait-state register. Wait states for the pro- 
gram and data spaces are specified in the lower and upper halves of PDWSR, 
respectively. Wait states for |/O space are specified in IOWSR. The bits of 
CWSR control the mapping between wait-state register contents and the num- 
ber of wait states. 


The program and data spaces each consist of 64K addresses. Each 64K 
space can be viewed as being composed of four 16K-word blocks. Each 16K 
address segment in program and data space is associated with 2 bits in 
PDWSR, as shown in Table 5—3. The value of a 2-bit field in PD WSR specifies 
the number of wait states to be inserted for each access in the given space and 
address range. 
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Table 5—3. Software Wait-State Registers 
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PDWSR 0- 
2-3 
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eo | at 
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OCO00h—OFFFFh 
OCOO0Oh-OFFFFh 


1 
iil  BiG=o0 IG = 
1 /O Port 0/1, Port 10/11, etc 0000h—1FFFh 
| 23 Port 2/3, Port 12/13, etc 2000h-3FFFh 
1 


Port 4/5, Port 14/15, etc. 4000h—5FFFh 
Port 6/7, Port 16/17, etc. 6000h—7FFFh 


Port 8/9, Port 18/19, etc 8000h—9FFFh 


Port OA/OB, Port 1A/1B OAQ00h—OBFFFh 
etc. 


Port OC/OD, Port 1C/1D, OCO00h-ODFFFh 
ate. 


Port OE/OF, Port 1E/1F, etc. | OEQ00h-OFFFFh 


The I/O space wait-state register (IOWSR) can be mapped in either of two 
ways, as specified by the BIG bit in the CWSR register. If BIG=0, each of 8 pairs 
of memory-mapped I/O ports has its own 2-bit field in IOWSR. Note that even 
when BIG=0, the entire I/O space is configured with wait states on two-word 
boundaries (i.e., port 0/1, port 10/11, and port 20/21 all have the same number 
of wait states). This configuration provides maximum flexibility when I/O bus- 
cycles access peripherals such as D/A and A/D devices. However, if 1/O ac- 
cesses read and/or write devices that are addressable (e.g., external RAM), 
BiG can be set to 1. In this case, the 64K I/O space is divided into eight 8K-word 
address blocks, with each block having an independently programmable num- 
ber of wait states. 


10—11 
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Note that the wait-state generators affect external accesses only; internal ac- 
cesses always have zero wait states. 


The four bits in CWSR allow the user to select one of two mappings between 
2-bit wait-state fields and the number of wait states for the corresponding 
space. As shown in Table 5—4, if a particular bit of CWSR is a zero, the map- 
ping between wait-state field values and the resulting number of wait states 
is direct: the number of wait states for external accesses in the space asso- 
ciated with that control bit is equal to the wait-state field value. If the control bit 


of CWSR is a one, the number of wait states is determined by the mapping 
shown in Table 5—4. Table 5—5 shows the layout of the CWSR register in 
PDWSR and IOWSR registers. You should always program the CWSR regis- 
ter prior to configuring the PDWSR and IOWSR registers to avoid configuring 
memory with too few wait states during the set-up of wait-state registers. 


Table 5—4. Wait-State Field Values and Wait States as a Function of CWSR Bit n 


Wait-State Fieldt No. of Wait States No. of Wait States 
of PDWSR or IOWSR (CWSR Bit n = 0) (CWSR Bit n = 1) 
(Binary Value) 


t This bit field corresponds to the bit field defined in the second column of Table 5-3. 


Table 5—5. Space Controlled by CWSR Bit n 
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Figure 5-3 shows a block diagram of the wait-state generator logic for external 
program space. When an external program access is decoded, the appropri- 
ate field of the PDWSR wait-state register is loaded into the counter. If the field 
is not 000, a not-ready signal is sent to the CPU. The not-ready condition is 
maintained until the counter decrements to zero and the external READY line 
is high. The external READY and the wait-state register READY are ORed to- 
gether to generate the CPU WAIT signal. Also, the READY line is sampled at 
the falling edge of CLKOUT. (Note that the external READY line is machine- 
sampled only at the last cycle of an external access if the on-chip wait-state 
generator is used to insert software wait states). 


Upon reset, all the software wait-state control register fields are set to 7. 
CWSR is set to OFh. Device reset also sets the BIG bit of the CWSR register 
to zero. 
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5.4 General-Purpose I/O Pins 


The ’C5x devices have two general-purpose pins that are software controlled. 
The BIO pin is a branch control input pin, and the XF pin is an external flag out- 
put pin. For detailed timing specifications of BIO and XF signals, refer to Ap- 
pendix A. 


The BIO pin monitos peripheral device status—especially as an alternative to 
an interrupt when time-critical loops must not be disturbed. A branch can be 
conditionally executed when the BIO input is active (low). The timing diagram, 
shown in Figure 5-4, is an example of the BIO operation. This timing diagram 
is for a sequence of single-cycle, signal-word instructions located in external 
memory. The BIO condition is sampled during the decode phase of the pipeline 
for the XC instruction. All other instructions sample the BIO pin during the ex- 
ecute phase of the pipeline. 


Figure 5-4. BIO Timing Diagram 
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The XF (external flag) pin signals to external devices via software. Itis set high 
by the SETC XF (set external flag) instruction and reset to a low level by the 
CLRC XF (reset external flag) instruction. XF is set high upon device reset. The 
relationship between the time SETC/CLRC instruction is fetched, and the time 
the XF pinis set or reset as shown in Figure 5—5. As with BIO, the timing shown 
for XF is for a sequence of single-cycle, single-word instructions located in ex- 
ternal memory. Actual timing may vary with different instruction sequences. 


Figure 5—5. External Flag Timing Diagram 
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5.5 Serial Port 


A full duplex (bidirectional) on-chip serial port provides direct communication 
with serial devices such as codecs, serial A/D (analog to digital) converters, 
and other serial systems. The interface signals are compatible with codecs 
and many other serial devices. The serial port may also be used for intercom- 
munication between processors in multiprocessing applications (the TDM port 
is further optimized for such an application). 


Both receive and transmit operations are double-buffered on the ’C5x, thus al- 
lowing a continuous communications stream (either 8- or 16-bit data packets. 
The continuous mode provides operation that once initiated requires no further 
frame synchronization pulses when transmitting at maximum packet frequen- 
cy. The serial port is fully static and thus will function at arbitrarily low clocking 
frequencies. The maximum operating frequency of the serial port while using 
internal clocks is CLKOUT1/4 (5 Mbit/s at 50 ns, 7.14 Mbit/s at 35 ns). When 
the serial ports are in reset the device may be configured to shut off the serial 
port internal clocks, allowing the device to run in a lower power mode of opera- 
tion. 


5.5.1 Serial Port Operation 


Table 5-6 lists the pins used in serial port operation. Three signals are neces- 
sary to connect the transmit pins of the transmitting device with the receive 
pins of the receiving device for data transmission. The transmitted serial data 
signal (DX) sends the actual data. The transmit frame synchronization signal 
(FSX) initiates the transfer (at the beginning of the packet), and the transmit 
clock signal (CLKX) clocks the bit transfer. The corresponding pins on the re- 
ceive device are DR, FSR and CLKR, respectively. Figure 5-6 shows these 
pins for two 'C5x serial ports connected for a one-way transfer from device 0 
to device 1. 


Table 5—6. Serial Port Pins 


Transmit clock signal 
CLKR Receive clock signal 


Transmitted serial data signal 


POR Received serial data signal 


FSX Transmit frame synchronization signal 
FSR Receive framing synchronization signal 
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‘C5x Device 1 


The serial port operates through the three memory-mapped registers (SPC, 
DXR, and DRR) and two other registers (XSR and RSR) that are not accessi- 
ble but permit double-buffering capability. These five registers are listed in 
Table 5—7. 


Table 5—7. Serial Port Registers 


(oRR[atarsveregiir 
asa 


Transmit shift register 
Receive shift register oO 


Figure 5—7 shows how the pins and registers are configured on the serial port 
and how the double-buffering is implemented. 
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Figure 5—7. Serial Port Block Diagram 
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| Control 
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transfer Logic transfer 


XSR (16) | 
| (Clear) (Clear) pom 
Byte/Word Counter | (Clock)| | | |(Clock){ | Byte/Word Counter 


FSR FSX 


DR CLKR CLKX DX 


The SPC controls serial port operation; the functions of SPC bit fields are de- 
scribed in Table 5-8. Transmit data is written to the DXR, while received data 
is read from the DRR. Atransmit is executed by writing data to the DXR, which 
copies the data to the XSR when the XSR is empty (the last word has been 
serially transmitted, that is, driven on the DX pin). The XSR manages the shift- 
ing of the data to the DX pin, thus allowing another write to DXR as soon as 
the DXR-to-XSR copy is completed. 


Upon completion of the DXR-to-XSR copy, a 0-to-1 transition occurs on the 
transmit ready XRDY bit in the SPC and generates a serial port transmit inter- 
rupt (XINT — see subsection 5.1.2 for more information on ’C5x interrupts) 
that signals that DXR is ready for a new word. The process is similar on the 
receive side. Data from the DR pin is shifted into the RSR, which copies it to 
the data receive register (DRR) from which it may be read. Upon completion 
of the RSR-to-DRR copy, a 0-to-1 transition occurs on the receive ready 
(RRDY) bit in the SPC and generates a serial port receive interrupt (RINT). 
Thus, the serial port is double-buffered because data can be transferred to or 
from DXR or DRR while another transmit or receive is being performed. Note 
that the transfer timing is synchronized by the frame sync pulse in burst mode 
and is discussed in more detail in subsection 5.5.2. 
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Figure 5—8 shows the 16-bit memory-mapped register that configures the seri- 
al port. Some of the bits are read-only while others are read/write. 


a 5—8. Serial Port Control Register 


Felon [reno oREWrTV[ var] mor [ws [nero or] wou] Fou ro] oa] Res 


R/W R/W R R R/W R/W R/W R/W RW RWW RW R 
Note: R = Read, W = Write 


Table 5—8. Serial Port Control Register Bits Summary 


The Digital Loopback Mode Bit can be used to put the serial port in digital loopback mode. When DLB=1, 
DR and FSR are connected to DX and FSX, respectively, through multiplexers, as shown in 
Figure 5—9(a) and Figure 5—9(b). Additionally, CLKR is driven by CLKX if MCM=1. If DLB=1 and 
MCM=0, CLKR is taken from the CLKR pin of the device. This configuration allows CLKX and CLKR 
to be tied together externally and supplied by a common external clock source. The logic diagram for 
CLKR is shown in Figure 5—9(c). If DLB=0, DR, FSR, and CLKR are taken from the respective device 
pins. Note that TXM must be set to one for proper operation in DLB mode. Note also that the FSX and 
DX signals appear on the device pins when DLB=1, but FSR and DR do not. 


The Format Bit specifies the word length of the serial port transmitter and receiver. If FO=0, data is trans- 
mitted and/or received as 16-bit words. If FO=1, data is transferred as 8-bit bytes. The data is transferred 
with the MSB first. 


The Frame Synch Mode Bit specifies whether frame synchronization pulses are required for serial port 
operation. If FSM=1, a frame sync pulse is required on FSX/FSR for the transmission/reception of each 
word. When the serial port is operated in the continuous mode, FSM=0. Refer to subsection 5.5.1 for 
more details on the frame sync signals. 


The Clock Mode Bit specifies the clock source for CLKX. If MCM=0, CLKX is taken from the CLKX pin. 
If MCM=1, CLKX is driven by an on-chip clock source having a frequency equal to one-fourth of 
CLKOUT1. Note that if MCM=1 and DLB=1, a CLKR signal is also supplied by the internal source. 


The Transmit Mode Bit configures the FSX pin as an input (TXM = 0) or as an output 
(TXM = 1). When TXM = 1, frame sync pulses are generated internally when data is transferred from 
the DXR to DSR to initiate data transfers. The internally generated framing signal is synchronous with 
respect to CLKX. When TXM = 0, the transmitter idles until a frame synch pulse is supplied on the FSX 
pin. 
The Transmit Reset and Receive Reset signals reset the transmitter and receiver, respectively. If the 
SPC is to be modified to reconfigure the serial port, a total of two writes should be made to the SPC. The 
first write should write zeroes to XRST and RRST and the desired configuration to bits 1-5. The second 
write should write ones to XRST and RRST, taking the serial port out of reset. When a zero is written.to 
either of these bits, activity in the corresponding section of the serial port halts. Note that when XRDY=0, 
writing a zero to XRST generates a transmit interrupt. When XRST=0, RRST=0, and MCM=0, the inter- 
nal clocks to the serial ports are shut off, allowing the device to run in a lower power mode of operation. 


6] XRST 
7] RRST 
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Table 5—8. Serial Port Control Register Bits Summary (Continued) 
8 
af | 
| RRDY | 
XRDY 


The Input 0 Bit and Input 1 Bit allow ihe CLKR and CLKX pins to be used as bit inputs. INO and IN1 reflect 
the current levels of the CLKR and CLKX pins, respectively, of the device. The levels on these pins can 
be read by reading the SPC. They can be tested by using the PLU or the BIT or BITT instruction. Note 
that there is a latency of between 0.5 and 1.5 CLKOUT1 cycles in length from CLKR/CLKX switching 
to the new CLKR/CLKX value being represented in the SPC. 
Receive Ready and Transmit Ready Bits. A transition from 0 to 1 of the RRDY bit indicates that the re- 
ceive shift register (RSR) has been copied to the DRR and that the data can be read. A receive interrupt 
is generated upon the transition. A transition from 0 to 1 of the XRDY bit indicates that the DXR contents | 
have been copied to the XSR and that data is ready to be loaded with a new data word. A transmit inter- 
rupt is generated upon the transition. These bits can be polled in software in lieu of using serial port inter- 
rupts. 
The Transmit Shift Register Empty Flag. This bit indicates whether the transmitter has experienced un- 
derflow. Underflow occurs when two conditions are satisfied: 1) the XSR empties, and 2) the DXR has 
not been reloaded since the last DXR-to-XSR transfer. Note that underflow does not constitute an error 
condition in burst mode. If another frame synch pulse occurs prior to writing the DXR while in burst mode, 
the previous data in the XSR is shifted out the DX pin. Writing to DXR inactivates the XSREMPTY bit. | 
XSREMPTY<=0 indicates underflow. 
The Receive Shift Register Full Flag. This bit indicates whether the receiver has experienced overrun. 
Overrun occurs when three conditions are satisfied: 1) RSR is full, 2) the DRR has not been read since 
the last RSR-to-DRR transfer, and 3) a frame sync pulse appears on FSR. Note that condition 3 applies 
only when FSM=1. When FSM=0, only the first two conditions apply. When RSRFULL=1, the receiver 
halts and waits for the DRR to be read. The data in the RSR is preserved, but any data sent on DR while 


' = 

| 13} RSRFULL 

| the receiver is halted is lost. Reading DRR, device reset, and serial port reset each clear the RSRFULL 
: | bit. RSRFULL=1 indicates overflow. 


The SOFT bit. This bit is enabled when the FREE bit is 0. If FREE=0, the SOFT bit selects immediate | 
14 | SOFT . apie 
| stop if 0, stop after word completion if 1. See page 5-23. 


p15. FREE | The FREE bit. If FREE=1, free run is selected, regardless of the value of the SOFT bit. If FREE=0, the 


SOFT bit selects the emulation mode as described above. See page 5-23. 


Bit 0 is reserved and is read as 0 (although it performs a function in the TDM 
serial port, explained in Section 5.6). The format bit FO, bit 1 of the SPC, speci- 
fies whether data is transmitted as 16-bit words (FO=0) or 8-bit bytes (FO=1). 
Note that in the latter case, only the lower byte of whatever is written to DXR 
on the transmitter is transmitted and the lower byte of whatever is read from 
DRR on the receiver is received. To transmit a whole 16-bit word in 8-bit byte 
mode on the transmitter, two writes to DXR are necessary, with the appropriate 
shifts of the value because the upper 8 bits written to DXR are ignored. Similar- 
ly, to receive a whole 16-bit word in 8-bit mode on the receiver, two reads from 
DRR are necessary, with the appropriate shifts of the value, because the upper 
8 bits in DRR are random values. 


The source device for the clock for serial port transfers is set by bit 4 (MCM) 
of the SPC register. If MCM=1, then the CLKX is configured as an output and 
is driven by an internal clock source with a frequency equal to 1/4 of CLKOUT1. 
lf MCM=0, CLKX is configured as an input and thus accepts an external clock. 
Note that the CLKR pin is always configured as an input. 
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The source device for the frame synchronization pulse is set with the TXM bit, 
(bit 3). Like MCM, if TXM=1, the FSX pin is configured as an output and drives 
a pulse at the beginning of every transmit. If TXM=0, FSX is configured as an 
input and accepts an external frame sync signal. Note that the FSR pin is al- 
ways configured as an input. 


The reset of the serial port for both transmitter and receiver is done by the 
XRST bit and the RRST bit, bits 6 and 7, respectively. These signals are active 
low, so that if XRST=RRST=0, the serial port is in reset. To modify SPC to con- 
figure the serial port, a total of two writes to the SPC are necessary. The first 
write should write zeros to the XRST and RRST and the desired configuration 
bits 1-5. While maintaining the desired configuration bits, the second write 
should write ones to XRST, and RRST, bits, taking the serial port out of reset. 
Note that these bits can be reset individually if desired. When a zero Is written 
to either of these bits, activity in the corresponding section of the serial port 
stops. When XRST=0 and RRST=0, the particular internal clocks to the serial 
port are shut off. This minimizes the switching and allows the device to operate 
on lower power consumption (as long as the CLKX bit is configured as an input 
— that is, with MCM=0). 


The FSM bit (bit 3) specifies whether frame syncs are needed in consecutive 
serial port transmits. If FSM=1, a frame sync is required for every transfer and 
the mode is referred to as burst mode, because there may be periods of inac- 
tivity on the serial port between transmits. The frequency of packet writes to 
DXR is called packet frequency. The packets can be 8 or 16 bits long, depend- 
ing on FO. 


As the packet frequency increases, it reaches a maximum that is equivalent 
to 8 or 16 clock cycles, depending on FO. Note that this cycle count corre- 
sponds to 32 or 64 instruction cycles on the CPU, again depending on FO if 
internal ’C5x clocks are used. Thus, if transmitting at maximum rate for more 
than one transmission, the frame sync signal becomes extraneous. The con- 
tinuous mode of operation (FSM=1) is then the mode that requires only an ini- 
tial frame sync pulse, as long as awrite to DXR for transmit, or aread from DRR 
for receive, is executed during each transmission. The timing of both modes 
is dicussed in detail in subsections 5.5.2 and 5.5.3. 


The DLB bit, (bit 1) is a digital loop back mode that allows testing of the serial 
port code with just one device. When DLB=1, DR and FSR are connected to 
DX and FSX, respectively, through multiplexers, as shown in Figure 5-9. 
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Figure 5—9. Receiver Signal MUXes 
(a) (b) 


DR FSR 
DR (internal) FSR (internal) 


DX | FSX 


DLB oe DLB 


CLKX 


CLKR is driven by CLKX if MCM=1. But if MCM=0 while DLB=1, then CLKR 
is taken from the CLKR pin. This allows for external clock generation of these 
signals during digital loopback mode. If DLB=0, then normal operation occurs 
where DR, FSR, and CLKR are all taken from their respective pins. 


Bits 10—13 in the SPC are read-only status bits that indicate various states in 
serial port operation. Writes and reads to the serial port may be synchronized 
by polling RRDY and XRDY, (bits 10 and 11, respectively) or by using the inter- 
rupts that they generate. A transition from 0 to 1 of the RRDY bit indicates that 
the RSR has been copied to the DRR and that the received data may be read. 
A receive interrupt (RINT) is generated upon this transition. A transition from 
Oto 1 of the XRDY bit indicates that the DXR contents have been copied to the 
XSR and that DXR is ready to be loaded with a new data word. A transmit inter- 
rupt (XINT) is generated upon this transition. Polling these bits in software may 
either substitute for or complement the use of serial port interrupts. In other 
words, both polling and interrupts can be used together if so desired. The 
XSREMPTY bit (bit 12) indicates whether the transmitter has experienced un- 
derflow. (When XSREMPTYS0, it is active). 


The following three situations cause the XSREMPTY flag to become active: 


DXR has not been loaded since the last DXR—XSR transfer 


AND XSR empties (The actual transition of XSREMPTY occurs after the 
last bit has been shifted out of XSR) 


OR serial port reset (XRST=0) 
OR device reset 
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When XSREMPTY is active, the transmit side of the serial port halts, thus driv- 
ing no value (the DX pin is in a high-impedance state). An exception occurs 
in burst mode with external frame syncs, which is explained in subsection 
5.5.4. Note that underflow does not constitute an error condition in the burst 
mode, although it does in the continuous mode (error conditions are further 
discussed in subsection 5.5.4). The XSREMPTY flag becomes inactive 
(XSREMPTY=1) when: 


A write to DXR occurs. Note that more information on the transmit timing 
is explained in subsection 5.5.2. 


The RSRFULL bit, (bit 13) indicates whether the receiver has experienced 
overrun (When RSRFULL=1, it is active). 


Overrun occurs when: 


The DRR has not been read since the last RSR-to-DRR transfer. 
AND RSR is full. 
AND a frame sync pulse appears on FSR. 


Note that in continuous mode (FSM=0), only the first two conditions apply; 
therefore, RSRFULL transitions after the last bit has been shifted out. When 
RSRFULL=1, the receiver halts and waits for DRR to be read. Thedatain RSR 
is preserved, but any new data driven on the DR pin while the receiver is halted 
is lost. 


The RSRFULL flag becomes inactive (RSRFULL=0) under the following three 
conditions: 


DRR is read 
OR serial port is reset (RRST=0) 
OR device is reset 


INO and IN1 (bits 8 and 9) inthe SPC allow the CLKR and CLKX pins to be used 
as bit inputs. INO and1N1 reflect the current levels of the CLKR and CLKX pins. 
The levels on the pins can be read by reading the SPC. They can be tested 
by using the PLU or BIT or BITT instructions. Note that there is a latency of 
between 0.5 and 1.5 CLKOUT1 cycles in length from CLKR/CLKX switching 
to the new CLKR/CLKX value being represented in the SPC. Note that if the 
serial port is put into reset, INO and IN1 can be used as bit inputs and DRR and 
DXR as general-purpose registers. SOFT and FREE (bits 14 and 15) are spe- 
cial emulation bits that determine the state of the serial port clock when a 
breakpoint is encountered in the high-level language debugger. If the FREE 
bit (bit 15) is set to one, then upon a software breakpoint, the clock continues 
to run (that is, free runs) and data is shifted out. In this case, SOFT (bit 14) is 
a don't care. But if FREE is 0, then SOFT takes effect. If SOFT=0, then the 
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clock immediately stops, thus aborting any transmission. If the SOFT bit is 1, 
the particular transmission continues until completion of the word, and then the 
clock halts. The options are as follows: 


FREE SOFT 

1 X Free run 

0 0 Immediate stop 

0 1 Stop after completion of word 


The receive side functions in a similar fashion. Note that if an option besides 
immediate stop is chosen, the receiver continues running and an overflow er- 
ror is possible. The default value for these bits is immediate stop. 


5.5.2 Transmit and Receive Operations (Burst Mode) 


In burst mode operation, there are periods of serial port inactivity between 
packet transmits. The data packet is marked by the frame sync pulse on FSX. 
On the transmit device, the transmission is initiated by a write to DXR. The val- 
ue in DXR is shifted to XSR; upon a frame sync pulse on FSX (generated inter- 
nally or externally depending on TXM), the value in XSR is shifted out and driv- 
en on the DX pin. If DXR is reloaded before the old DXR contents have been 
transferred to XSR, the old DXR contents are overwritten. The DXR is copied 
to the XSR only if the XSR is empty and the DXR has been loaded since the 
last DXR to XSR transfer. The DXR should be written to only if XRDY=1, which 
is guaranteed if the DXR write is made in response to a transmit interrupt or 
polling XRDY. The timing for the serial port transmit is shown in Figure 5—10. 


Figure 5—10. Burst-Mode Serial Port Transmit Operation 
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DXR XSR 
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Note in the following discussion that the timings are slightly different for inter- 
nally (TXM=1, FSX is an output) and externally (TXM=0, FSX is an input) gen- 
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erated frame syncs. This distinction is made because in the former case, the 
frame sync pulse is generated by the transmitting device as a direct result of 
a write to DXR. In the latter case, there is no such direct effect. Instead, the 
transmitting device must write to DXR and wait for an externally generated 
frame sync. 


If frame sync pulses are internally generated (TXM=1), then after a write to 
DXR, a frame sync pulse is generated on the next rising edge of CLKX (For 
externally generated frame syncs the following events will occur whenever the 
frame sync pulse appears by the rising edge of CLKX after a write to DXR). 
Then on the next falling edge of CLKX, XSR is loaded with the value from DXR, 
and XRDY goes high, generating a transmit interrupt (XINT). On the next rising 
edge of the CLKX cycle, the first data bit (MSB first) is driven on the DX pin. 
With the fall of the frame sync pulse, the rest of the bits will be shifted out. 
(Therefore, the first bit could have variable length if the frame sync is gener- 
ated externally and does not fall within one CLKX cycle. Internally generated 
frame syncs are guaranteed by ’C5x timings). 


When all the bits are transferred, the DX pin enters the high-impedance state. 
Note that if DXR had not been loaded when XINT was generated, the 
XSREMPTY flag would become active (go low), indicating underflow. Thus, 
there is a 2-CLKX cycle latency (approximately) after DXR is loaded, before 
the data is driven on the line, assuming that the frame sync pulse is generated 
internally (TXM=1). If the pulse is externally generated, this latency does not 
exist, and the timing specifications are relaxed. With externally generated 
frame sync, if the XSREMPTY flag is active and a frame sync pulse is gener- 
ated, any old data in the DXR is transmitted. This is explained in detail in sub- 
section 5.5.4. 


Figure 5—11. Burst-Mode Serial Port Receive Operation 
| | | | | | | | | | | | | | 
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The shifting into RSR begins on the falling edge of the CLKR cycle after the 
frame sync has gone low. After all the bits have been received, the contents 
of the RSR are transferred to the DRR on the falling edge of CLKR and RRDY 
goes high, generating a receive interrupt (RINT), as shown in Figure 5—11. 
Note that if the DRR from the previous receive had not been read and a frame 
sync appears, the RSRFULL flag would go high. This condition is an actual er- 
ror and introduces questions of the serial port’s behavior under various error 
situations: for example, the appearance of frame sync during a receive. Vari- 
ous error situations are discussed in subsection 5.5.4. 


Note that if the packet frequency is increased, the inactivity period between the 
data packets for adjacent transfers decreases to zero. This corresponds to a 
minimum period between frame sync pulses (equivalent to 8 or 16 CLKX/R 
cycles, depending on FO) that corresponds to a maximum packet frequency 
at which the serial port may operate. At maximum packet frequency in 
Figure 5—12, the timing looks like a compressed version of Figure 5—10. 


Figure 5—12. Burst-Mode Serial Port Transmit at Maximum Packet Frequency 
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The data bits in consecutive packets are transmitted continuously with no inac- 
tivity in between the bits. The frame sync pulse overlaps the last bit transmitted 
in the previous packet. The receive side in Figure 5—13 looks similar. 
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Figure 5-13. Burst-Mode Serial Port Receive at Maximum Packet-Frequency 


8 NIV ININ NINN INI NPA DPN. 
| 


| | | | | | | | | | | | | 
FSR 
: | | | | | | | | | | | | | 
DR | | : | | | | a | 
cae C2) C2 C3 <) C).€2 €) 2 «) «<3 6 CoC 
| | | MSB | | | | | | LSB | | | 
RINT | | | | | | | | | | | | | | 
f° t 
DRR DRR 
Loaded Loaded 
From RSR From RSR 


The maximum packet frequency transfer looks like a compressed version of 
burst mode with no periods of inactivity. The frame sync pulse overlaps the first 
bit transmitted. 


Figure 5—12 and Figure 5—13 show the transfer of multiple data packets at 
maximum packet frequency; the frame sync appears to be extraneous in- 
formation. Since the data packets are transmitted at a constant rate, the CLK 
provides enough timing information for the transfer and permits a continuous 
stream of data. Theoretically, only an initial frame sync signal is needed to initi- 
ate the multipacket transfer. This continuous mode is supported by the ’C5x 
serial port and is discussed in subsection 5.5.3. 


Figure 5-14. Burst-Mode Serial Transmit Operation With Delayed Frame Sync in External 
Frame Sync Mode 
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The operation of the serial port with external frame sync is similar to that with 
internal frame sync. Events occur when the external frame sync appears. 
When the external frame sync is delayed, however, the double buffer is filled 
and frozen until the delayed frame sync appears, as shown in Figure 5—14. 
When the delayed frame sync occurs, A is transmitted on DX; after the trans- 
mit, a DXR-to-XSR copy of B occurs, and XINT is generated. The next frame 
sync after the delayed frame sync causes B to be transmitted on DX. Note than 
when the loading of B into DXR occurs, a DXR-to-XSR copy of B does not oc- 
cur, and XINT is not generated because A has not been transmitted on DX. Any 
subsequent writes to DXR before the delayed frame sync occurs would over- 
write DXR. 


5.5.3 Transmit and Receive Operations (Continuous Mode) 


In the continuous mode, the frame sync signal on FSX/FSR is not necessary 
for consecutive packet transfers at maximum packet frequency after the initial 
pulse. Continuous mode is selected by setting FSM=0. Upon the first store to 
DXR in continuous mode, a frame sync is generated for the first transmission 
and then no more. As long as DXR is updated once every transmission, the 
continuous mode continues. Failing to update causes the serial port to halt, as 
in the burst mode case (The XSREMPTY flag becomes asserted etc.). If DXR 
is written to after the halt, the device restarts the continuous mode transmit and 
generates an FSX, assuming that the frame sync is internally generated. This 
distinction that occurs between transmits using internal and external frame 
syncs is similar to the one discussed in subsection 5.5.2. 


If the frame syncs are externally generated (TXM=0), then DXR should be 
loaded, and the appearance of an external frame sync on the FSX pin restarts 
anew continuous mode transmit. If the DXR has not been updated with exter- 
nal frame sync, the DX pin remains in the high-impedance state. This is differ- 
ent from the burst mode operation and is covered in detail in subsection 5.5.4. 
The continuous mode may be discontinued — in other words changed to burst 
mode — only by a serial port or device reset. Changing the FSM bit during 
transmit or halt is not guaranteed to switch to burst mode. 


The transmit timing in continuous mode is shown in Figure 5—15. 
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Figure 5—15. Serial Port Transmit Continuous Operation 
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Transmit timing in continuous mode is similar to the continuous stream in 
Figure 5—12. The major difference is the lack of a frame sync pulse after the 
initial one. As long as DXR is updated once per transmission, this mode will 
continue. Overwrites to DXR behave just as in burst mode. The data written 
last will be transmitted. XSR operation is not disturbed. An external FSX pulse 
on the line will abort the present transmission, cause one data packet to be 
lost, and initiate a new continuous mode transmit. This is explained in more 
detail in subsection 5.5.4. 


The receive operation is similar to the transmit operation. After the initial frame 
sync pulse on FSR, no more frame syncs are needed. This mode will continue 
as long as DRR is read every transmission. If it is not read, the serial port re- 
ceive will halt (RSRFULL flag becomes active). Reading DRR will restart the 
continuous mode as soon as a frame sync is received. The continuous mode 
must be discontinued with a serial port or device reset. The receive timing can 
be seen in Figure 5—16. 
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Figure 5-16. Serial Port Receive Continuous Operation 
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Figure 5-16 shows no frame signals; otherwise, it is similar to Figure 5—13. If 
a pulse occurs on FSR during transmission (an error), then the receive opera- 
tion is aborted, one packet is lost, and a new receive cycle is begun. This is 
discussed in more detail on page 5-22. 


5.5.4 Error Conditions 


Error conditions result from an unprogrammed event occurring to the serial 
port. These conditions are operational aberrations such as overrun, underflow, 
or a frame sync pulse during a transmission. You may need to understand how 
the serial port handles these errors and the state it acquires during these error 
conditions. Because they differ slightly in burst and continuous modes, the er- 
ror conditions are discussed separately. 


In burst mode, the first error condition (discussed in subsection 5.5.1) is the 
RSRFULL flag. Basically, this flag occurs when the device has not read incom- 
ing data and more data is being sent, which is indicated by a frame sync pulse 
on FSR. The processor halts serial port receives until DRR is read. Thus, any 
further data sent is lost. If receive errors continue, and the frame sync occurs 
during a receive (that is, data is being shifted into RSR from DR pin), then the 
present receive is aborted and a new one begins. Thus, the data that was be- 
ing loaded into RSR is lost, but the data in DRR is not. No RSR-to-DRR copy 
occurs. Figure 5-17 shows the serial port receive side behavior for a frame 
sync pulse during a receive and includes nonerror situations. 
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Figure 5—17. Receive Error (Normal or Burst Mode) 
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Transmit errors in burst mode result when a frame sync occurs during various 
conditions. Underrun in burst mode is not considered an error but is explained 
in subsection 5.5.1. If atransmit is in progress (thatis, XSR data is being driven 
on the DX pin) when the frame sync pulse occurs, then the present transmit 
is aborted, and data in the XSR is lost. Then, whatever data is in the DXR at 
the time of the frame sync pulse is transferred to XSR (DXR-to-XSR copy) for 
transmitting. However, a transmit interrupt XINT is generated only if the DXR 
has been written to after the last transmit. Also, if XSREMPTY is active and a 
frame sync pulse appears, the old data in DXR is shifted out. Figure 5—18 sum- 
marizes serial port transmit behavior with error (and nonerror) conditions. 


Figure 5—18. Transmit Error (Normal or Burst Mode) 
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In continuous mode, errors take on a broader meaning. Data transfer is sup- 
posed to be occuring at all times in continuous mode. Thus, underflow 
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(XSREMPTY=0) is considered an error in continuous mode because data is 
not being transmitted. As in burst mode, overrun is an error, and both of these 
cause the Serial port receive or transmit sections to halt. The operation of both 
these flags is explained in subsection 5.5.1 in the XSREMPTY and RSRFULL 
flags description. Underflow and overrun errors are not fatal; they can be cor- 
rected by reading DRR or writing to DXR. In a write to DXR to deactivate 
XSREMPTY, either a frame sync pulse is generated (if FSM=1) or required (if 
FSM=0). On the receive side, however, after DRR is read to deactivate 
RSRFULL, a frame sync pulse is not required. The receive side of the serial 
port keeps track of the word (either 8- or 16-bit) boundary, even though itis not 
receiving data. When the RSRFULL flag is deactivated by a read from DRR, 
the receiver begins the read from the correct bit. 


Another cause for error is the appearance of frame syncs during a transmis- 
sion. After the initial frame sync in continuous mode, no others should occur. 
When a frame sync pulse occurs during a transmit, the current transmit opera- 
tion (that is, serially driving XSR data onto DX pin) is aborted, and data in XSR 
is lost. A new transmit cycle is initiated, as long as the DXR is updated once 
per transmission afterward. During a receive in continuous mode, the situation 
is similar: if a frame sync pulse occurs, one packet of data (8-bit byte or 16-bit 
word, depending on FQ) is lost. The RSR bit counter is reset, so the data that 
was being shifted into RSR from the DR pin is lost. Data then driven on DR is 
shifted into RSR. Therefore, the frame sync during transmission chart for con- 
tinuous mode looks like the left half of the burst mode charts in Figure 5—17 
and Figure 5-18 because a receive or transmit is always in progress. 


Figure 5—19 and Figure 5—20 show receive and transmit errors for continuous 
mode. Note that if a frame sync occurs after deactivating the RSRFULL flag 
by reading DRR but before the beginning of the next word (either 8- or 16-bit) 
boundary, a receive abort condition occurs. Also, note a major difference in the 
transmit continuous mode error compared with transmit burst mode error. If 
XSREMPTY is active in continuous mode and an external frame sync occurs, 
no old data is transmitted. Instead, since underflow in continuous mode is con- 
sidered an error, the frame sync pulse is ignored, and the DX pin remains in 
the high-impedance state. 
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Figure 5-19. Receive Error (Continuous Mode) 
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Figure 5—20. Transmit Error (Continuous Mode) 
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5.5.5 Example 


The code example that follows shows a one-way transmit from device 0 to de- 
vice 1 of an arithmetic sequence of numbers. The numbers are written in each 
device in a block from 9000h to bOOOh in data memory. Device 0 waits ina BIO 
loop for a ready to receive signal (XF) from device 1 and initializes the transfer 
with a value of zero. Only its transmit interrupt is enabled; its transmit ISR 
writes the value it will send into its own memory. 
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* Device 0 — Transmit side 


SPLK #0038h, SPC 
SPLK #00F8h, SPC 
SPLK #0ffffh, IFR 
SPLK #020h, IMR 
CLRC INTM 

ILOOP BCND SENDZ, BIO 
B ILOOP 

SENDZ LACL #0 
LAR AR7, #9000h 
SACL * 
SACL DXR 

SELF1 B SELF1 

XMT ISR LACC. AR7 
SUB #0b000h 
BCND END _SERP,GEQ 
LACL *+ 
ADD so #1 
SACL * 
SACL DXR 
RETE 

END SERP B END_SERP 


Serial Port 


;setup SPC as CLK source 
;and internal frame sync 
;Set TXM=MCM=FSM=1, 

> TDM=DLB=FO=0. 

;And put SP into reset 

; (XRST=RRST=0) 

Take SP out of reset 


;Setup interrupts 
*clear IFR 

;Turn on XINT 

;enable interrupts 
;Wait to for ready—to— 
sreceive from other device 
*First transmit/write 
‘value is 0 

;Setup where to write 
*Write first value 
‘Transmit first value 
;Wait for interrupts 
*Check if past 0x0b000 
*i.e. end of block 

7Go to tight loop if so 
eAdd one and transmit 
*Load value 

>Add one 

-Write value 

*Transmit value 


;Sit in tight loop after 
;block is complete. 


The code in device 1 follows. It sends a ready-to-recieve signal (XF) to device 
0. Only its receive interrupt is masked and its receive ISR reads from the DRR, 
writes to the block, and checks to see if it has reached the end of the block. 


Device 1 — Receive 


SPLK #0008h, SPC 


SPLK #00C8h, SPC 


SPLK #0ffffh, IFR 
SPLK #010h, IMR 


Set SP as CLK, frame 

;sync receive 

Set TXM=MCM=DLB=FO=0, 
*>FSM=1. 

;And put SP into reset 
; (XRST=RRST=0) 


;Take SP out of reset 


;Setup interrupts 
clear IFR 
‘Turn on RINT 
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CLRC INTM ;Enable interrupts 
LAR AR7, #9000h ;Setup where to write 
sreceived data 

CLRC XF ;Signal ready to receive 
SELF 1 B SELF1 ;Wait for interrupts 
RCV_ISR 

LACL DRR sLoad received value 

SACL *+ ;Write to memory block 

LACC AR7 ;Check if past 0x9000 

SUB #0b000h si.e. end of block 

BCND END SERP, GEQ *Go to tight loop if so 
END _SERP B END_SERP ;Sit in tight loop after 


sblock is complete. 
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5.6 TDM Serial Port 


The 'C5x devices have a TDM (time-division-multiplexed) serial port that al- 
lows the device to communicate serially with up to seven other ’C5x devices. 
The TDM port provides a simple and efficient interface for multiprocessing 
applications. 


The TDM serial port is a superset of the serial port described in Section 5.5. 
By means of the TDM bit in the TSPC control register, the port can be confi- 
gured in multiprocessing mode (TDM=1) or stand-alone mode (TDM=0). 
When in stand-alone mode, the port operates as described in Section 5.5. 
When in multiprocessing mode, the port behaves as described in this section. 
The port can be shut down for low power consumption viathe XRST and RRST 
bits as described in Section 5.5. 


Time-Division Multiplexing 


Time-division multiplexing is the division of time intervals into a number of sub- 
intervals, with each subinterval representing a communications channel ac- 
cording to a prespecified arrangement. Figure 5—21 shows a 4-channel TDM 
scheme. Note thatthe first time slot is labeled chan 1 (channel 1), the next chan 
2 (channel 2), etc. Channel 1 is active during the first communications period 
and during every fourth period thereafter. The remaining 3 channels are inter- 
leaved in time with channel 1, as shown in the figure. 


The 'C5x TDM port supports eight TDM channels. You can independently 
specify which device is to transmit and which device or devices are to receive 
for each channel. This results in a high degree of flexibility in interprocessor 
communications. 


Figure 5-21. Time-Division Multiplexing 
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5.6.2 TDM Port Operation 


Figure 5—22(a) shows the ’C5x TDM port architecture. Up to eight devices can 
be placed on the four-wire serial bus. This four-wire bus consists of a conven- 
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tional serial port’s bus of clock, frame, and data (TCLK, TFRM, and TDAT) 
wires plus an additional wire (TADD) that carries the device addressing in- 
formation. The TADD line, which is driven by a particular device for a particular 
time slot, determines which devices in the TDM configuration can execute a 
valid TDM receive on that time slot. This is similar to a valid serial port read 
operation described in Section 5.5, except that the corresponding TDM regis- 
ters are named differently. The TDM receive register is TRCV, and the TDM 
receive shift register is TRSR. The actual data is transmitted on the bidirection- 
al TDAT line. 


Note in Figure 5—22(b) that the device TDX and TDR pins are tied together ex- 
ternally to form the TDAT line. Also note that only one device can drive the data 
and address line (TDAT and TADD) in a particular slot. Meanwhile, in that par- 
ticular slot, all the devices (including the one driving that slot) sample the TDAT 
and TADD lines to see if the data is a TDM valid read. This is discussed in detail 
later in this section. In a valid TDM read, the value is transferred from the TRSR 
register to the TRCV register, and a receive interrupt is generated, indicating 
that the TRCV has valid receive data and can be read. 


All TDM port operations are synchronized by the TCLK and TFRM lines, which 
are generated by one device each (typically the same device), referred to as 
the TCLK and TFRM sources. The word master is not used here because it 
implies that one device controls the other. This is not the case, and you must 
set TCSR to prevent slot contention. Consequently, the remaining devices in 
the TDM configuration use these lines as inputs. Figure 5—22(b) shows 
TCLKX and TCLKR are externally tied together to form the TCLK line. Also, 
TFRM and TADD originate from the TFSX and TFSR pins respectively. The 
reason for this is to make the TDM serial port easy to use in standalone mode. 
The TDM port operation is controlled by several memory-mapped registers. 
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Figure 5—22. TDM Four-Wire Bus 
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Each device has six memory-mapped registers associated with the TDM serial 
port. The layout of these registers is shown in Figure 5-23. The TRCV and 
TDXR registers have the same functions as the DRR and DXR registers re- 
spectively, described in Section 5.5. The TSPC register is identical to the SPC 
register except that bit 0 is not reserved in TSPC. See subsection 5.5.1 for its 
operation. This bit (TDM) configures the port in stand-alone mode (TDM=0 — 
In this mode the TDM serial port operates like the standard serial port de- 
scribed in Section 5.5) or in multiprocessor mode (TDM=1). 


Bits DLB and FO in the TSPC are hard-configured when the port is in multipro- 
cessor mode. These bits are set to zero when TDM=1, resulting in no access 
to the digital loopback mode and in a fixed word length of 16 bits (A different 
type of loopback is covered in the example in subsection 5.6.5). The value of 
FSM does not affect the port when TDM=1. Also, when TDM=1 the underflow 
and overrun flags are not operational (Subection 5.6.4 explains how these er- 
rors are treated in TDM mode). If TDM=1, changes made to the contents of the 
TSPC become effective upon completion of channel 7 of the current frame. 
Thus the TSPC value cannot be changed for athe current frame. Any changes 
take effect on the next frame. 
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Figure 5-23. TDM Port Registers 
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The source device for the timing signals TFRM and TCLK is set by MCM and 
TXM, respectively. The TCLK source device is identified by setting the TXM 
bit of its TSPC register to one. Typically, this device is the same one that sup- 
plies the TDM port clock signal TCLK. TCLKX pin is configured as an input if 
MCM=0 and an output if MCM=1. In the latter (internal 'C5x clock) case, the 
device whose MCM=1 supplies the clock (TCLK frequency=one fourth of 
CLKOUT1 frequency) for all devices on the TDM bus. The clock can be sup- 
plied by an external source if MCM=0 for all devices. TFRM can also be sup- 
plied externally if TXM=0. An external TFRM must meet TDM receive timing 
specifications with repect to TCLK for proper operation. No more than one de- 
vice should have MCM or TXM set to one at any given time. The specification 
of which device is to supply clock and framing signals is typically made only 
once, during system initialization. 


The TDM channel select register (TCSR) of a given device specifies in which 
time slot(s) that device is to transmit. A 1 in bits 0-7 of the TCSR sets the trans- 
mitter active during the corresponding time slot. A key system-level constraint 
repeated here is that no more that one device can transmit during the same 
time slot. The devices do not check for bus contention. You must assign the 
slots consistently. As in TSPC operation, a write to TCSR during a particular 
frame is valid only during the next frame. However, a given device can transmit 
in more than one slot. This is discussed in more detail in subsection 5.6.3, with 
an emphasis on the utilization of TRTA, TDXR, and TCSR in this respect. 


The TDM receive/transmit address register (TRTA) of a given device specifies 
two key pieces of information. The lower half specifies the receive address of 
the device, while the upper half of TRTA specifies the transmit address. The 
receive address is the 8-bit value that a device compares to the 8-bit value it 
samples on the TADD line in a particular slot to determine whether it should 
execute a valid TDM receive. The receive address establishes the slots in 
which that device may receive. This process occurs on each device during ev- 
ery slot. The transmit address corresponds to what a device drives on the 
TADD line during a transmit operation on an assigned slot. The transmit ad- 
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dress establishes which receiving devices may execute a valid TDM receive 
on the driven data. 


Only one device at a time can drive a transmit address on TADD. Each proces- 
sor bitwise-logical-ANDs the value it samples on the TADD line with its receive 
address. If this operation results in a nonzero value, then a valid TDM receive 
is executed. Thus, for one device to transmit to another, there must be at least 
one bit in the upper half of the first device’s TRTA (the transmit address) with 
a value of 1 that matches one bit with a value of 1 in the lowe: “alf of TRTA (the 
receive address) of the second device. This method of configuration of TRTA 
allows the transmitting device to control which devices receive, without having 
to change the receive address on any of the devices. 


The TDM receive address register (TRAD) holds various information on the 
status of the TADD line, which can be polled to verify the integrity of this line 
and to verify the relationship between instruction cycle and TDM port timing. 
Bits 13—11 (xa—Xo) hold the current slot number value, whether a valid data re- 
ceive was executed or not. This value is latched at the begininng of the slot and 
latched only until the end of the slot. Bits 10—8 (So—Sp) hold the number of the 
last slot plus one (modulo 8) in which data was received. This value is latched 
at the end of the slot in which a valid data receive occurred during the TDM 
receive interrupt (TRNT), and maintained until the end of the next slot that is 
a valid receive. Bits 7-0 (a7—a9) hold the last value sampled on the TADD line, 
whether a valid data receive was executed or not. This value is latched half- 
way through the slot (So the value on the TADD may be shifted in) and main- 
tained until half-way through the next slot, whether a valid receive is executed 
or not. 


5.6.3 Transmit and Receive Operations (TDM Mode) 


Figure 5—16 shows the timing for the TDM port transfers. The TCLK and TFRM 
signals are generated by the timing source device. The TCLK frequency is one 
fourth the frequency of CLKOUT1 if generated by a ’C5x device. The TFRM 
pulse occurs every 128 TCLK cycles. This allows 16 data bits for each of 8 time 
slots to be driven on the TDAT line. This also permits the processor to execute 
amaximum of 64 instructions between each slot, assuming that a’C5x internal 
Clock is used. Beginning with slot 0 and with the MSB first, the transmitter 
drives 16 data bits for each slot, with each bit having a duration of 1 TCLK cycle 
(the exception is the first bit of each slot, as noted below). The data is driven 
onto the TDAT line on the rising edge of TCLK and read on the falling edge. 
Meanwhile, the transmitter also drives the TADD line with its transmit address. 
This information, unlike that on TDAT, is only one byte long and is transmitted 
with the LSB first for the first half of the slot. During the second half of the slot 
(that is, the last eight TCLK periods) the TADD line is driven high. The TDM 
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receive logic samples the TADD line only for the first eight TCLK periods, ignor- 
ing it during the second half of the slot. Therefore, the transmitting device (if 
not a ’C5x) may choose to drive TADD high or low during that time period. 


Figure 5-24. Serial Port Timing in TDM Mode 
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If none of the devices on the TDM bus are configured to transmit in a slot (that 
is, none of the devices have a 1 for the corresponding slot in their TCSR regis- 
ter), that slot qualifies as an empty slot. In an empty slot, both TADD and TDAT 
will be high impedance. This has the potential for spurious receives because 
the device actually samples TDAT and TADD for every slot and determines a 
valid TDM receive if its receive address matches the receive address on the 
TADD line. To avoid spurious reads, a pull-down 1-kQ resistor must be tied to 
the TADD line. This causes the TADD line to read low on empty slots. Other- 
wise, any noise on the TADD line that happens to match a particular receive 
address would result in a spurious read. If power dissipation is a concern and 
the resistor is not desired, then an arbitrary processor with transmit address 
equal to Oh can drive empty slots by writing to TDXR in those slots. Slot manip- 
ulation is explained later in this section. The 1-kQ resistor is not needed in the 
TDAT line. 


TDAT 


An empty slot is defined by the following two cases: the first obvious case oc- 
curs when no device has its TCSR configured to transmit in that slot. A second 
more subtle case occurs when TDXR has not been written to before a slot. This 
may happen when TCSR contents are changed because they are not sampled 
until the TFRM pulse occurs. Therefore, any subsequent change takes effect 
only on the next frame. The same is true for the receive address (the lower half 
of TRTA). But the transmit address (upper half of TRTA) and the TDXR (ob- 
viously) may be changed for the current frame for a particular slot, assuming 
that slot has not yet been reached when the instruction is executed. 


Note that the transmit address does not need to be written every time a write 
to TDXR is executed. During a write to TDXR, whatever value is in the TRTA 
is transmitted. You can test the current slot by examining TRAD while using 
the XRDY flag or transmit interrupt. This flexibility affords TDM slot manipula- 
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tion and even slot sharing if you so desire. The key is to understand the timing 
relationship between the instructions being executed and the frame/slots of 
the TDM port. Simply stated, the TCSR and the receive address (lower half of 
TRTA) take effect only at the start of a new frame, while the transmit address 
(upper half of TRTA) and TDXR (transmit data) can take effect at the start of 
a new slot. 


When changing a transmit address on the fly, be careful not to corrupt the re- 
ceive address; both are located in the same register TRTA. Thus, this scheme 
follows the philosophy of allowing the transmitting device to set which devices 
can receive. Regarding empty slots, note that in a TDM port the frame sync 
on TFRM is being transmitted at all times, not just when there is a write to 
TDXR. Thus, if a device does not happen to write to TDXR during its selected 
slots (by TCSR), it will have an empty slot that shows up as high impedance 
on the TDAT and TADD lines. 


As a final note on timing, the duration of the first bit (bit 15 TDAT and bit 0 of 
TADD) of each slot is only half the normal duration. Also, the TFRM overlaps 
bit O of time slot 7. Refer to the timing diagrams in Appendix A. 


5.6.4 TDM Error Conditions 


Due to time slots and the ability for one processor to transmit in multiple slots, 
the concept of overflow and underrun becomes unclear. Thus, the overrun and 
underflow flags are not enabled in the TDM port in TDM mode. On the receive 
side, if DRR has not been read and a valid receive operation is initiated (due 
to the value on TRTA and the device’s receive address), the present value of 
DRR is overwritten. Thus, the TDM port is not halted. On the other hand, dur- 
ing atransmit if DXR has not been updated, nothing will be driven on the TADD 
or TDAT lines. The pins will be in high impedance. This mode of operation pre- 
vents spurious transmits from occurring. 


lf TFRM pulses occur during a nonregular time in transmission, the TDM port 
fails. In other words, only one TFRM should occur every 128 TCLK cycles. Un- 
like the serial port, the TDM port cannot be reinitialized with a frame sync pulse 
during transmission. 


5.6.5 Example of TDM Operation 


Table 5—9 shows the data represented by the TADD signal for each of the eight 
channels, given the transmitter and receiver designations shown. This exam- 
ple shows the configuration for eight devices to communicate with each other. 
In this example, device 0 broadcasts to all device addresses. In subsequent 
frames, devices 1—7 communicate to one other processor. 
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TDM Serial Port 


Table 5-9. Interprocessor Communications Scenario 


Channel TADD Data Transmitter Receiver Device(s) 
Device 


Table 5-10 shows the TDM port register contents of each device that results 
in the scenario given in Table 5-9. Device 0 provides the clock and frame con- 
trol signals for all channels and devices. The TCSR and TRTA register con- 
tents specify which device is to transmit on a given channel and which devices 
are to receive. 


Table 5—10. TDM Register Contents 
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Se 
O102h 


wan | tah 


In Table 5-10, the transmit address of a particular device (the upper byte of 
TRTA) matches the receive address (the lower byte of TRTA) of the receiving 
device. Butit is not necessary for the transmit and receive addresses to match 
exactly. Remember that the matching operation implemented on the receive 
side is a bitwise AND. Thus, only one bit must match. The advantage of this 
scheme is that a transmitting device can select the devices to receive its data 
by changing its transmit address only. The receive address of the receiving de- 
vice does not need to be changed (assuming the receive address is unique). 
In the example, device 0 can transmit to any combination of the other devices 
by merely writing to the upper byte of TRTA. For example, if it changed its 
TRTA to 08001h on the fly, it would transmit only to device 7. A device can write 
to itself because the transmit is executed on the rising edge and the receive 
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on the falling edge of TCLK. To enable this sort of loop back, it is necessary 
to have the wired-OR pins connected (the TDAT and TCLK lines). In the exam- 
ple, if device 0 has a TRTA of 00101h, it would transmit to itself. 


In the code example below, a one-way transmit from device 0 to device 1 of 
an arithmetic sequence of numbers is shown. The numbers are written in each 
device in a block from 4000h to 6000h in data memory. Device 0 transmits on 
slot 0 and has a transmit address of Oth. It waits in a BIO loop for a ready to 
receive signal (XF) from device 1 and initializes the transfer with a value of 
zero. Only its transmit interrupt is enabled, and its transmit ISR writes the value 


it will send into its own memory. 


* Device 0 — Transmit side 


SPLK #1h, TCSR 


SPLK #100h, TRTA 


SPLK #0039h, TSPC 


SPLK #00F9h, TSPC 


SPLK #0ffffh, IFR 
SPLK #080h, IMR 


CLRC INTM 


TILOOP BCND TSENDZ, BIO 
B TILOOP 


TSENDZ LACL #0 


LAR AR7, #4000h 
* 


SACL 

SACL TDXR 
SELF2. B SELF2 
TXMT ISR 

LACC AR7 

SUB #6000h 


BCND END _ TDMP, GEQ 


LACL *+ 
ADD #1 
SACL * 
SACL TDXR 
RETE 


;Setup TCSR to xmt on 
-slot 0 
;Setup transmit address 


;set up TSPC as TCLK, TFRM 
ssource 

»Set TXM=MCM=FSM=TDM=1, 

> DLB=FO=0. 

sAnd put TDM into reset 

; (XRST=RRST=0) 

*Take TDM out of reset 


*Setup interrupts 
sclear IFR 
:;Turn on TXNT 


;enable interrupts 


;Wait for ready—to— 
sreceive from other device 


*First transmission/write 
svalue is 0. 

;Setup where to write 
‘Write first value 
‘Transmit first value 


;Wait for interrupts 


;Check if past 0x6000 
si.e. end of block 
*Go to tight loop if so. 


sAdd one and transmit 
Load value 

:;Add one 

*Write value 
*Transmit value 
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END _TDMP B 


END_TDMP 


*Sit in tight loop after 
sblock is complete. 


The code in device 1 follows. It has a receive address of 01h and sends a 
ready-to-receive signal (XF) to device 0. Only its receive interrupt is masked, 
and its receive ISR reads from the TDRR, writes to the block, and checks to 


see if it has reached the end of the block. 


*Device 1 — receive side 


SPLK 


SPLK 


SPLK 


SPLK 


SPLK 
SPLK 


CLRC 
LAR 


CLRC 
SELF2 B 


TRCV_ISR 
LACC 
SACL 
LACC 
SUB 
BCND 
RETE 


END_TDMP B 


#0h, TCSR 


#001h, TRTA 


#0009h, TSPC 


#00C9h, TSPC 


#0ffffh, IFR 
#040h, IMR 


INTM 
AR7, #4000h 


XF 
SELF2 


TRCV 
*+ 

AR7 
#6000h 


END TDMP, GEQ 


END_TDMP 


me =™e@ woe TO MO WE WE =e "06 EO 


=e ™=6 €@O 


me te TO MO 


=e 


=e =e SO 6TH 6TMO 


=e 0 


Setup TCSR to xmt on 
no slots 
Setup receive address 


Set TDM as TCLK, TFRM 

receive 

Set TXM=MCM=DLB=FO=0, 

FSM=TDM=1. 

And put TDM into reset 
(XRST=RRST=0 ) 

Take TDM out of reset 


Setup interrupts 
clear IFR 
Mask on TRNT 


enable interrupts 

Setup where to write 
received data 

Signal ready to receive 


Wait for interrupts 


Load received value 
Write to memory block 
Check if past 0x6000 
i.e. end of block 

Go to tight loop if so 


Sit in tight loop after 
block is complete. 
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5.7 Timer 


The timer is an on-chip down counter that can be used to periodically generate 
CPU interrupts. The timer is decremented by one at every CLKOUT1 cycle. 
A timer interrupt (TINT) is generated each time the counter decrements to 
zero. The timer thus provides a convenient means of performing periodic I/O 
or other functions. Figure 5-25 shows a logical block diagram of the timer. 
When the timer is stopped (TSS = 1), the internal clocks to the timer are shut 
off, allowing the device to run in a lower power mode of operation. 


Figure 5-25. Timer Block Diagram 


SRESET 


Tre 


, CLKOUT1 
| @ 
> TINT 


>. TOUT 


The timer interrupt rate is given by 


Se 
TINT rate = 7 xk UXV ~ ty X (TDDR+ 1) x (PRD +1) 


where t,(c) is the period of CLKOUT1, u is the sum of the TDDR contents (see 
Table 5-11) plus 1, and vis the sum of the PRD contents (see Figure 5—25) 
plus 1. 


Therefore, the timer interrupt rate is equal to the CLKOUT1 frequency divided 
by two independent factors. Referring to Figure 5~25, each of the two divisors 
is implemented with a down counter and period register. The counter and peri- 
od registers for the first stage are the PSC and TDDR fields of the TCR, respec- 
tively, and each is 4 bits wide. The counter and period registers for the second 
stage are the memory-mapped, 16-bit wide TIM and PRD registers. Each time 
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a counter decrements to zero, a borrow is generated on the next CLKOUT1 
cycle, and the counter is reloaded with the contents of its corresponding period 
register. The output of the second stage is the timer interrupt signal sent to the 
CPU and to the timer output pin (TOUT). The width of the borrow pulse appear- 
ing on the output of stage 2 is equal to t.(c) (see Appendix A). 


The timer operation is controlled via the timer control register (TCR). Bits 0-3 
constitute the TDDR field of the TCR. Upon reset, TDDR is set to zero. The 
timer can be stopped and restarted with the TSS bit and can be reset with the 
TRB bit. The timer is stopped by setting the TSS bit to one and restarted by 
setting the TSS bit to zero. When the timer stopped, the internal clocks are shut 
off to the timer, allowing a lower power mode of operation. Upon reset, the TSS 
bit is zero, and the timer immediately starts timing. The timer period can be re- 
loaded by setting the TRB bit to one. These bits are defined in the TCR as 
shown in Table 5—11. Bits 6-9 constitute the PSC field of the TCR. Figure 5-26 
shows the bit layout of the timer control register. 


Table 5—11. Timer Control Register 


_ Bit |_ Name |Description 
4 TSS | Stop Timer=1, Restart Timer=0 
ps | TRB | Reload Timer with Period=4 


Figure 5—26. Timer Control Register (TCR) 


5-46 


15-12 11 10 9-6 5 4 3-0 


The contents of the PRD register are loaded into the timer counter register 
(TIM) when the timer counter register decrements to zero or when the timer 
is reset by setting the TRB bit to 1. The TRB bit is always read as zero. When 
a 1is written to TRB, the timer is reset, but TRB is still read as zero. The TDDR 
(timer divide down register) is loaded by writing the appropriate divide-down 
value into the TCR. As with the TIM/PRD register pair, the value of TDDR is 
not immediately loaded into the prescaler counter (PSC). The prescaler count- 
er is loaded with the value in TDDR when it decrements to zero or when the 
timer is reset by setting the TRB bit to 1. The PSC can be read by reading the 
TCR register, but cannot be written directly via software. Bits 10 and 11 are 
special emulation bits that determine the state of the serial port clock when a 
breakpoint is encountered in the high-level language debugger. Please see 
page 5-23 for their functional description. Bits 15—12 are always read as zero. 
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The current value in the timer can be read by reading the TIM register; the pre- 
scaler counter can be read by reading the TCR. Because it takes two instruc- 
tions to read both registers, there may be a change between the two reads as 
the counter decrements. Therefore, where precise timing measurements are 
being made, it may be more accurate to stop the timer to read these two val- 
ues. The timer can be stopped by setting the TSS bit to one and restarted by 
resetting this bit to zero. 


The timer provides a convenient and efficient way to generate a sample clock 
for an analog interface. Consider the following example of using the timer to 
generate a sample rate of 50 kHz. The initialization for this example is as fol- 
lows: 


*Clkin frequency = 20 MHz, timer is running at 10 MHz. 
* 


LDP #0 

SPLK #199,PRD ;Load timer period for 20 us period. 
OPL #8,IMR -Set timer interrupt mask bit 

SPLK #20h,TCR *reload and start timer. 

SPLK #10000b,IFR ;Clear any pending timer interrupts. 
CLRC INTM ;global interrupt enable. 


* 


Consider an analog-to-digital converter operating at this sample rate. A typical 
interrupt service routine (ISR) would be as follows: 


*50 kHz sample rate A/D interrupt service routine 
* 


TIMER ISR MAR*,AR3 ;Use auxiliary register reserved for 


stimer ISR. 
IN *,14 ;Read A/D. 
RETE *>Re—enable interrupts and return. 
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5.8 Divide-by-One Clock 
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The divide-by-one clock feature on the 'C5x consists of a phase lock loop (PLL) 
peripheral, which provides the capability to supply a clock cycling at the ma- 
chine cycle rate of the CPU. This is a desirable feature because it reduces a 
system’s high-frequency noise that is due to a high-speed switching clock. 
When this peripheral feature is implemented, the external frequency source 
can be used by injecting the clock directly into CLKIN2, with X1 left uncon- 
nected and X2 connected to Vpp. The divide-by-one option is used when the 
CLKMD1 pin is strapped high and CLKMD2 is strapped low. The PLL is not 
enabled in all other clock modes, and clocks are shut off to the module to allow 
a lower power mode of operation. 


The processor generates two internal clocks, via the input clock, to the device. 
The CLKOUT1 signal indicating the CPU machine cycle rate equals the input 
clock. The PLL has a maximum operating frequency of 28.6 MHz (on a 35-ns 
‘C5x device). The PLL requires a transitory locking time of 256 cycles. See Ap- 
pendix A for more information on the external input frequency specification. 
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Chapter 6 


The total memory address range of the 'C5x devices is 224K 16-bit words. The 
memory space is divided into four specific memory segments: 64K program, 
64K local data, 32K global data, and 64K I/O port. The parallel nature of the 
architecture of the ’C5x devices allows for the device to perform three concur- 
rent memory operations in any given machine cycle: fetching an instruction, 
reading an operand, and writing an operand. The 'C5x memory configuration 
and operation are described in the following sections: 


Topic Page 


6.1 Memory Space 
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The 'C5x design is based on the enhanced Harvard architecture. This archi- 
tecture has multiple memory spaces that can be accessed on three parallel 
buses; this makes it possible to access both program and data simultaneously. 
The three parallel buses are the program read/write bus (PAB), data read bus 
(DAB1), and data write bus (DAB2). Each bus accesses different memory 
spaces for different aspects of the device operation. The ’C5x memory is orga- 
nized into four individually selectable spaces: program, local data, global data, 
and input/output ports (I/O). These spaces compose an address range of 
224K words. Within any of these spaces RAM, ROM, EPROM, EEPROM, or 
memory-mapped peripherals can reside either on- or off-chip. 


The program space contains the instructions to be executed as well as tables 
used in execution. The local data space stores data used by the instructions. 
The global data space can share data with other processors within the system 
or can serve as additional data space. The I/O space interfaces to external 
memory-mapped peripherals and can also serve as extra data storage space. 
Within a given machine cycle, the CALU can execute as many as three concur- 
rent memory operations. This chapter describes each memory space and the 
‘C5x memory map. 


The ’C5x devices include a considerable amount of on-chip memory to aid in 
system performance and integration. The ‘C50 includes 2K words of boot 
ROM, 9K words program/data single-access RAM (SARAWM) , and 1056 words 
of dual-access data RAM (DARAM). The boot ROM resides in program space 
at address 0 and includes a device test (for internal use) and boot code. The 
9K block of single-access RAM can be mapped to program and/or data space 
and resides at address 0800h in either space. The single-access RAM re- 
quires a full machine cycle to perform a read or a write. Tne dual-access RAM 
can be read from and written to in the same cycle. The 1056 words of dual-ac- 
cess RAM are configured in three blocks: block 0 (BO) is 512 words at address 
0100h—02FFh in local data memory, or OFEQOH-OFFFFh in program space; 
block 1 (B1) is 512 words at address O300h—04F Fh in local data memory; and 
block 2 (B2) is 32 words at address 060h in local data memory. 


The ’C51 removes the 2K boot ROM from program memory space. It also re- 
places 8K words of single-access program/data RAM with an 8K-word block 
of maskable ROM. The ROM is located in the address range Oh—1FFFh in pro- 
gram space. The additional 1K word of single-access RAM is mapped to data 
space (800h—OBFFh), program space (2000h—23FFh), or both spaces. The 
dual-access blocks of RAM on the 'C51 are mapped at the same addresses 
as the ’C50. 


The ’C53 has 16K words of on-chip maskable ROM and 3K words of single-ac- 
cess RAM. The ROM is located in the address range 0O-3FFFh in program 


Memory 


Figure 6-1. 'C50 Memory Map 
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space. The 3K words of single-access RAM are mapped into data space 
(800—13FFh), program space (4000-4BFFh), or both spaces. The dual-ac- 
cess RAM blocks on all ’C5x devices are mapped at the same addresses. 
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Figure 6—2. ’'C51 Memory Map 
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Figure 6-3. 'C53 Memory Map 
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6.2 Program Memory 


The external program memory space on the 'C5x devices addresses up to 64K 
16-bit words. In addition, ’'C5x devices have on-chip ROM, single-access pro- 
gram/data RAM, and dual-access RAM. Software can configure these 
memory cells to reside inside or outside of the program address map. When 
they are mapped into program space, the device automatically accesses them 
when it addresses within their bounds. When the CALU generates an address 
outside these bounds, the device automatically generates an external access. 
The advantages of operating from on-chip memory are as follows: 


1) Higher performance because no wait states are required for slower exter- 
nal memories. 

2) Lower cost than external memory. 

3) Lower power than external memory. 


The advantage of operating from off-chip memory is the ability to access a larg- 
er address space. 


Program Space Configurability 


The program memory can reside both on- and off-chip. After reset, the configu- 
ration is set by the level on the MP/MC pin. If this pin is high, the device is confi- 
gured as a microprocessor, and the on-chip ROM is not addressed. If this pin 
is low, the device is configured as a microcomputer, and the on-chip ROM is 
enabled. The 'C5x devices fetch their reset vector at location 0 of program 
memory; so, if the device is operating as a microcomputer, it starts running 
from on-chip ROM. Otherwise, it starts running from off-chip memory. Once 
the program is running, you can change the MP/MC configuration by setting 
or clearing the MP/MC bit in the PMST register. Note that the MP/MT pin is 
sampled only at reset. The following instruction removes the ROM from pro- 
gram space: 


OPL#8,PMST ;Remove boot ROM from program space. 


You can submit code to be masked for the 'C51’s 8K-word or for the ’'C53’s 
16K-word on-chip ROM. This is a process-masked ROM cell, which requires 
ROM codes to be submitted to Texas Instruments for implementation in the de- 
vice, as detailed in Appendix H. 


At reset, the single-access RAM and the 512-word program/data (BO) RAM 
are not resident in program space. You can make the single-access RAM resi- 
dent in program space by setting the RAM bit in the PMST register to 1. When 
the RAM bit is set, these RAM cells become addressable in program space. 
You can make the dual-access RAM block BO resident in program space 
(OFEOOh-OFFFFh) by setting the CNF bit to 1. The following code example 
maps these blocks into program space. 


Program Memory et : 


OPL #010h,PMST ;Map ‘C5x single-access memory 
sin program space. 
SETC CNF ;Map BO to program space. 
Table 6—1 through Table 6-3 show program memory configurations available 
on the 'C5x devices. Note that all addresses are specified in hexadecimal. 


Table 6—1.’C50 Program Memory Configuration Control 


LCNF | RAM | MPmc | ROM | SARAM | DARAMBO | Off-Chip_— 
ee ee ae 
po 0000-FFFF 
poo Tt | 000-07FF | osoo-2BFF_ | | RCOOFFFF 


0800-2BFF | Q000-O07FF 
eee 
pt t000-o7FF | FEOO-FFFF | 0800-FDFF 


Oe ee FEOO-FFFF | _0000-FDFF 
pot tt 000-07FF | 0800-2BFF | FEOO-FFFF | _2C00-FOFF 
Table 6—2.'C51 Program Memory Configuration Control 


Le ee 
| 2COO-FDFF | 

Tenr[ RAM | MPIC [ROM | SARAM | DARAMEO | On-chip 

a SS Oa 


et ee See 
EO OB 


2000-23FF 000—-1FFF 
| 2400-FFFF FFF 


SO 
pos EOF FFF | 0000-FDFF 
ott tT 000-1 FFF | 2000-2aFF | FEOO-FFFF | 2400-FDFF 


2000-23FF FEOO-FFFF 0000-1 FFF 
| 2400-FDFF 
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Table 6-3. ’C53 Program Memory Configuration Control 


Tent [Raw [wee [Row [Saran [| oarameo | oncnp | 
a 
a A OS 
a a 

s000-48FF 
ee ee ee 
oo oa [Fo 2000-FO 
orto [0000-5 
000 


0000-3FFF 4000—-4BFF FEOO-FFFF 4000-FDFF_ | 


4000-4BFF FEOO-FFFF 0000-1 FFF 
2400-FDFF 


6.2.2 Program Memory Address Map 


The reset, interrupt, and trap vectors are addressed in program space. These 
vectors are soft—meaning that the processor, when taking the trap, loads the 
PC with the trap address and executes code at the vector location. Two words 
are reserved at each vector location for a branch instruction to the appropriate 
interrupt service routine. Table 6—4 shows the interrupt vector addresses after 
reset. 


Table 6-4. ’C5x Interrupt Vector Addresses 
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At reset, these vectors are mapped absolutely to address Oh in program 
space. However, the vectors can be remapped to the beginning of any 2K- 
word page in program space after reset. This is done by loading the interrupt 
vector pointer (IPTR) bits in the PMST register with the appropriate 2K-word 
page boundary address. After loading IPTR, any user interrupt or trap vector 
is mapped to the new 2K-word page. For example: 


OPL #05800h, PMST ;Remap vectors to start at 5800h. 


This example moves the interrupt vectors to off-chip program space at ad- 
dress 05800h. Any subsequent interrupt (except for a reset) will fetch its inter- 
rupt vector from that new location. For example, if, after loading the IPTR, an 
INT2 occurs, the interrupt service routine vector will be fetched from location 
5804h in program space as opposed to location 04h. This feature facilitates 
moving the desired vectors out of the boot ROM and then removing the ROM 
from the memory map. Once the system code is booted into the system from 
the boot-loader code resident in ROM, the application reloads the IPTR with 
a value pointing to the new vectors. In the above example, the OPL instruction 
is used to modify the PMST. This example assumes that the IPTR is currently 
set to Os. If it is not,then it must be set to Os before this instruction is executed; 
this assures that the correct value for IPTR is set. 


6.2.3 Program Memory Addressing 


The program memory space contains the code for applications. It can also 
hold table information and immediate operands. The program memory is ac- 
cessed only by the PAB address bus. The address for this bus is generated 
by the program counter (PC) when instructions and long immediate operands 
are accessed. The PAB address bus can also be loaded with long immediate, 
low accumulator, or registered addresses for block transfers, multiply/accu- 
mulates, and table read/writes. 


The ’C5x devices fetch instructions by putting the PC on the PAB bus and read- 
ing the appropriate location in memory. While the read is executing, the PC is 
incremented for the next fetch. If there is a program address discontinuity (for 
example, branch, call, return, interrupt, or block repeat), the appropriate ad- 
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Program Memory 


dress is loaded into the PC. The PC is also loaded when operands are fetched 
from program memory. Operands are fetched from program memory when the 
device reads or writes to tables (TBLR and TBLW), when it transfers data to/ 
from data space (BLPD and BLDP), or when it uses the program bus to fetch 
a second multiplicand (MAC, MACD, MADS, and MADD). The PC is loaded 
with a value other than PC + 1 in the following ways: 


Long immediate address with branch or call instructions. 

Long immediate address with MAC, MACD, BLDP or BLPD instructions. 
Low accumulator with BACC or CALA instructions. 

Low accumulator with TBLR or TBLW instruction. 

BMAR with MADS, MADD, BLDP or BLPD instructions. 

CALU with an interrupt vector address (INTR, TRAP, or NMI) instruction. 
CALU with PASR when at the end of a block repeat loop. 

Top of stack popped with a return instruction. 


OUOOUOUUU 


The address flow of a program can be traced externally through the address 
visibility feature. This feature can be used to debug during program develop- 
ment; it is enabled after reset and disabled/re-enabled by setting/clearing the 
AVIS bit in the PMST register. The address visibility mode sends the program 
address out to the address pins of the device, even when on-chip program 
memory is addressed. Note that the memory control signals (PS, RD, etc.) are 
not active in address visibility mode. 


Instruction addresses can be externally clocked with the falling edge of the in- 
struction acquisition (IAQ) pin (see Appendix A for IAQ timings). These instruc- 
tion addresses include both words of a two-word instruction but do not include 
block transfers, table reads, or multiply/accumulate operands. The address 
visibility mode also allows a specific interrupt trap to be decoded in conjunction 
with the interrupt acknowledge (IACK) pin. While TACK is low, address pins 
A1—A4 can be decoded to identify which interrupt is being acknowledged (see 
Appendix A for [ACK timings). Once the system is debugged, the address visi- 
bility mode can be disabled by setting the AVIS bit to one. Disabling the ad- 
dress visibility mode lowers the power consumption of the device and the RF 
noise of the system. Note that if the processor is running while HOLDA is active 
low (HM = 0), the address is not visible at the pins, regardless of the address 
visibility mode. 


6.2.4 Program Memory Security Feature 


The on-chip program memory can be secured on the 'C5x devices. This secu- 
rity feature does not allow an instruction fetched from off-chip memory to read 
or write on-chip program memory. The pipeline controller tracks instructions 
fetched from off-chip memory, and, if the operand address resides in on-chip 
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program space, the instruction reads invalid data off the bus. The limitations 
of the mode are as follows: 


Instructions fetched from off-chip memory cannot read or write on-chip 
single-access and read-only memory. 

Instructions fetched from BO cannot read or write on-chip single-access 
and read-only program memory. 

Coefficients for off-chip multiply/accumulate instructions cannot reside in 
on-chip single-access and read-only program memory. 

The on-chip single-access memory cannot be mapped to data space. 
The emulator cannot work with on-chip program memory. 

The program memory address range that corresponds to the on-chip 
single-access RAM is not available for external memory. 


OOO oO oO O 


This feature can be used with the on-chip ROM to secure program code that 
is stored in external memory. The ROM code can include a decryption algo- 
rithm that takes encrypted off-chip code, decrypts it, and stores the routine in 
on-chip single-access program RAM. This is a process-mask option and, like 
the ROM, must be submitted to Texas Instruments for implementation. 


6.2.5 External interfacing to Program Memory 


The ’C5x devices can address up to 64K words of program memory off-chip. 
These are key signals for external memory interfacing: 


AO-A15 = 16-Bit Bidirectional Address Bus 
DO-D15 16-Bit Bidirectional Data Bus 


PS Program Memory Select 

STRB External Memory Access Active Strobe 

RD Read Select (External Device Output Enable) 
WE Write Enable 


TACK Interrupt Acknowledge 

READY Memory Ready to Complete Cycle 

HOLD ~—_— Request for Control of Memory Interface 

HOLDA Acknowledge HOLD Request 

BR Bus Request 

TAQ Acknowledge Bus Request (when HOLDA is low) 


An example of a minimal external program memory interface is shown in 
Figure 6—4. In this figure, the 'C5x device interfaces to an 8K x 8 EPROM. 
The use of 8-bit-wide memories saves power, board space, and cost over 
16-bit wide memory banks. The 16-bit-wide memory banks can be used with 
the same basic interface as the 8-bit-wide memories. Note that the 'C5x cannot 
directly execute code from 8-bit-wide memory. An on-chip program (such as 
a bootloader) is required to read 8-bit-wide memory to form 16-bit long instruc- 
tion words and transfer them to on-chip RAM. 
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Figure 6-4. Interface to External EPROM 


‘C5x 8K x 8 EPROM 
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The program select (PS) signal is connected directly to the chip select (CS) 
to select the EPROM on any external program access. The EPROM is ad- 
dressed in any 8K address block in program space. If multiple blocks of 
memory are to be interfaced in program space, a decode circuit that gates PS 
and the appropriate address bits can be used to drive the memory block chip 
selects. 


The RD signal is tied directly to the output enable (OE) pin of the EPROM. The 
OE signal enables the output drivers of the EPROM. The drivers are turned 
off in time to guarantee that no data bus conflicts occur with an external write 
by the ’C5x devices. 


The device can be interfaced to external program RAM by connecting the WE 
signal to the write enable signal of the RAM device. The ’C5x devices take two 
cycles on all external writes, including a half cycle before the WE goes low and 
a half cycle after WE goes high; this prevents buffer conflicts on the external 
buses. Additional write cycles can be obtained by modifying the software wait- 
state generator registers. Subsection 6.3.4 includes an example of interfacing 
to external RAM. 
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6.3 Local Data Memory 


The local data memory space on the 'C5x addresses up to 64K of 16-bit words. 
The 'C50, ’C51, and ’C53 have 9K, 1K and 3K words of on-chip single-access 
RAM (SARAM), respectively. All 'C5x devices have the same 1056 words of 
dual-access RAM (DARAM). These on-chip memory cells can be configured 
by software in or out of the local data address map. When these cells are 
mapped into data space, the device automatically accesses them when ad- 
dressing within their bounds. When an address is generated outside these 
bounds, the device automatically generates an external access. The advan- 
tages of operating from on-chip memory are as follows: 


1) Higher performance because no wait states are required. 

2) Higher performance because of better flow within the pipeline of the 
CALU. 

3) Lower cost than external memory. 

4) Lower power than external memory. 


The advantage of operating from off-chip memory is the ability to access a larg- 
er address space. 


6.3.1 Local Data Space Configurability 


The local data memory can reside both on and off chip. At reset, the configura- 
tion maps the 1056 words of dual-access RAM into local data space. Block BO 
can be reconfigured into program space by setting the CNF bitin ST1 to 1. The 
single-access RAM can be mapped into data space by setting the OVLY bit to 
1 in the PMST register. Table 6—5 the possible local data memory configura- 
tions available on the 'C50. Table 6-6 and Table 6—7 show the possible local 
data memory configurations available on the’C51 and’C53, respectively. Note 
that all locations in the address range, Oh—800h, that are not mapped into 
on-chip memory are on-chip reserved locations (BOh-FFh and 500h—7FFh). 
Addresses 0—4Fh contain on-chip memory-mapped registers, and addresses 
50—5Fh contain the memory-mapped 1/O ports. 


Table 6—5. ’C50 Local Data Memory Configuration Control 


[oviy [| DARAMB0 | DARAME1 | DARAMG2 | SARAM | OW-Ghip 
[0 | to0r-2FFn | so0n-aern | 6on-7en |__| _800n-FFFFn 
of = | sonar | eoren | ——*d;«00n FFP 
a 


| | _300h—4FFh 60h-7Fh 800h-2BFFh | 2COOh-FFFFh 


Go 
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Table 6-6. ’C51 Local Data Memory Configuration Control 


[cnr [vty | DARAMBO | DARAMB! | DARAME2 | SARAM | onchip 
[0 [0 | oon-2Fen | aoon-aFFn | eon-7en |__| eoon-FrFrh 
[7 [1 | oon-aren | soon-aren | eon-7en | 600n-BFFh | _Coor-FFFFn 
[+0 atonmarrn | eon-7en | ———*|—_e00ncFFF 

YF soonaren | eon-7en | 600m BFP | cooh-FrFFn 


Table 6—7.’C53 Local Data Memory Configuration Control 


[ cnr_[ oviy | parameo | parame! | DARAMe2 | SARAM | Off-Chip | 
po | 0 t00n-2FFnh_ | 300h—4FFh_ | Goh-7Fh_ || BOOH-FFFFH 
po | 1 t00n-2FFn | 300h—4FFh_ | Goh_7Fh__ | 800h—-13FFh__ | 1400h-FFFFM 
po = 800h—4FFh | Gon-7Fh_ | BOO -FFFFH 
pot tT 800h—aF Fh | Gon-7Fh__ | 800h-13FFh | 1400h-FFFFH 


6.3.2 Local Data Memory Address Map 


The 64K words of local data memory space include the memory-mapped reg- 
isters for the device. The memory-mapped registers reside in data page 0. 
Data page 0 has five sections of register banks: core CPU registers, peripheral 
registers, test/emulation reserved area, |/O space port hole, and scratch-pad 
RAM. 


[_} The 28 core CPU registers can be accessed with zero wait states. Some 
of these registers can be accessed through paths other than the data bus 
— for example, auxiliary registers can be loaded by the auxiliary register 
arithmetic unit (ARAU) by using the LAR instruction. 

[J The peripheral registers are the control and data registers used in the pe- 
ripheral circuits. These registers reside on a dedicated peripheral bus 
structure called the TIBUS. They require one wait state when accessed. 

[-.} The test/emulation reserved area is used by the test and emulation sys- 
tems for special information transfers. Writing to this area can cause the 
device to change its operational mode and, therefore, affect the oper- 
ation of the application. 

[.} The I/O space port hole provides addressability to 16 words of I/O space 
within the data address space. This allows access to I/O space (other than 
IN and OUT instructions) via the more extensive addressing modes avail- 
able within the data space. For example, the SAMM instruction can write 
to an I/O memory-mapped port as an OUT instruction does. The external 
interface looks as if an OUT instruction occurs (IS active). Port addresses 
reside off-chip and are subject to external wait states. They are also af- 
fected by the on-chip software wait-state generator, like any other nonme- 
mory-mapped |/O port. 

[.} The scratch-pad RAM block (B2) includes 32 words of dual-access RAM 
for variable storage without fragmenting the larger RAM blocks, both on 
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the device and external to the device. Table 6-8 shows the address map 
of data page 0. 


Table 6—8. Data Page 0 Address Map 
Name Description 


Core Processor Memory-Mapped Registers 
Reserved 
Interrupt Mask Register 


Global Memory Allocation Register 


a 
DD 


Interrupt Flag Register 


Er 


PMST 
RPTC 
BRCR 
PASR 
PAER 
TREGO 
TREG1 
TREG2 
DBMR 
R 


u 


rocessor Mode Status Register 

epeat Counter Register 

lock Repeat Counter Register 

lock Repeat Program Address Start Register 


> 


lock Repeat Program Address End Register 
Temporary Register Used for Multiplicand 
Temporary Register Used for Dynamic Shift Count (5 bits only) 
Temporary Register Used as Bit Pointer In Dynamic Bit Test (4 bits only) 
ynamic Bit Manipulation Register 


0 


> 
° 


Auxiliary Register Zero 


R1 Auxiliary Register One 
AR2 Auxiliary Register Two 
Auxiliary Register Three 
Auxiliary Register Four 
ARS5 Auxiliary Register Five 
AR6 Auxiliary Register Six 


R 
INDX 
ARCR 
CBSRI1 


N 


_ 
N 


Auxiliary Register Seven 

Index Register 

Auxiliary Register Compare Register 
Circular Buffer 1 Start Register 
CBER1 27 Circular Buffer 1 End Register 


| cBsR2 | 2 | 1C Circular Buffer 2 Start Register 
[oper2_| 29 | 10 | GrcularButer2endRegister 
[oscr «| —80~—«|~==StE —*«|;<rcular Buffer ContolRegistor 
[eman | st | 1F | Block MoveAddress Register 

Peripheral Memory-Mapped Registers 

| DAR | 92 | 20 | DataRecoweRegisior SS SCSC~“‘~*~C~*S 
[oxr «| 33—=«|=St «| «Cata TranemitRegistor SS SSCSCSC~C~C~* 
SPC [34 | 22 | SerialPortGontrolRegister SCS 
35 | 23 | Reserved 


ah, 


A 


ond, 


o 
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Table 6-8. Data Page 0 Address Map (Continued) 


Peripheral Memory-Mapped Registers (Continued) 


a 
= 
‘ 
N 


Timer Register 
Period Register 
Timer Control Register 


| 39 | 27 |__—=Reserved 
a Program/Data S/W Wait-State Register 
jowse [a1 | 8 


I/O Port S/W Wait-State Register 


CWSR ee ae Control S/W Wait-State Register 
a | Reserved for Test/Emulation 
| TROV =| 48s 80s TDM Data Receive Register 
TDM Data Transmit Register 


TDM Serial Port Control Register 
TDM Channel Select Register 


Receive/Transmit Address Register 


: 
; 


+ 
a 
rs 


Received Address Register 
Resened 
_ Memory-Mapped I/O Ports 
I/O Port 80 | 
| VOPots81 

1/O Port 82 

I/O Port 83 

/O Port 84 

I/O Port 85 

I/O Port 86 
| /O Port 87 
| | 0 Port 88 

I/O Port 89 

/O Port 90 

| VO Port 91 
| /O Port 92 

/O Port 93 

/O Port 94 
| 95 | 1/0 Port 95 
| 96-127 Scratch Pad RAM 


6.3.2.1 Auxiliary Register (ARO-AR7) 


The eight 16-bit auxiliary registers (ARO—AR7) can be accessed by the CALU 
and modified by the ARAU or the PLU. The primary function of the auxiliary 


PAO 
PAL 
PA2 
PAS 
PA4 
PA5 
PAG 
PA7 
PAS 
PAQ 


PAIO 
PAT 
PA12 
PA13 
PA14 
PA15- 


5 
5 
_ 


57 
A 
C 
D 

5F 


so 
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registers is generating 16-bit addresses to data space. However, these regis- 
ters can also act as general-purpose registers or counters. Subsection 6.3.3 
describes how these registers are used in indirect addressing. 


6.3.2.2 Auxiliary Register Compare Register (ARCR) 


The auxiliary register compare register (ARCR) is a 16-bit register for address 
boundary comparison. The ARCR is compared to the selected AR by the 
CMPR instruction, and the result of the compare is placed in the TC bit of ST1. 
Subsection 6.3.3 describes how the ARCR can be used in memory manage- 
ment. 


6.3.2.3 Index Register (INDX) 


The index register (INDX) is used by the ARAU as a step value for indirect ad- 
dressing modifications to auxiliary registers (i.e., addition or subtraction by 
more than 1). For example, when the ARAU steps across a row of a matrix, 
the indirect address is incremented by 1. However, when the ARAU steps 
down a column, the address is incremented by the dimension of the matrix. 
The ARAU can add or subtract the value stored in INDX from AR(ARP) as part 
of the indirect address operation. The INDX register is also used to map the 
dimension of the address block used for bit-reversal addressing. Subsection 
6.3.3 describes how INDX can be used in memory management. 


6.3.2.4 Circular Buffer Registers (CBSR1, CBER1, CBSR2, CBER2, CBCR) 


The ’C5x devices support two concurrent circular buffers operating in conjunc- 
tion with user-specified auxiliary registers. Two circular buffer start registers 
(CBSR1 and CBSR2) indicate the 16-bit address where the circular buffer 
starts. Two circular buffer end registers (CBER1 and CBER2) indicate the end 
of the circular buffers. The circular buffer control register (CBCR) controls the 
operation of these circular buffers. Subsection 6.3.3 describes how circular 
buffers can be used in memory management. 


6.3.2.5 Block Move Address Register (BMAR) 


The 16-bit block move address register (BMAR) holds an address value for 
use with block moves and multiple/accumulate operations. This register pro- 
vides 16-bit address to a second indirect-addressed operand for these opera- 
tions. The use of the BMAR is described further in subsection 6.3.3. 


6.3.2.6 Repeat Registers (RPTC, BRCR, PASR, and PAER) 


The repeat counter (RPTC) holds the repeat count in a repeat single-instruc- 
tion operation. This register is loaded by the RPT and RPTZ instructions. 
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The block repeat counter register (BRCR) holds the count value for the block 
repeat feature. This value is loaded before a block repeat operation is initiated. 
It can be changed while a block repeat is in progress; however, take caution 
in this case to avoid infinite loops. The program address start register (PASR) 
holds the start address of the block of code to be repeated. The program ad- 
dress end register (PAER) holds the end address of the block of code to be 
repeated. Both these registers are loaded by the RPTB instruction. Block re- 
peats are described in more detail in subsection 3.6.5. 


6.3.2.7 Interrupt Registers (IMR,IFR) 


The interrupt mask register (IMR) is used to individually mask off specific inter- 
rupts at required times. The interrupt flag register (IFR) indicates the current 
status of the interrupts. Interrupts are described in detail in Section 3.8. 


6.3.2.8 Global Memory Allocation Register (GREG) 


The global memory allocation register (GREG) is used to allocate parts of the 
data address space as global memory. This register defines what amount of 
the local data space will be overlayed by global data space. The operation of 
GREG is further discussed in Section 6.4. 


6.3.2.9 Dynamic Bit Manipulation Register (DBMR) 


The dynamic bit manipulation register (DBMR) is used in conjunction with the 
PLU to provide a dynamic (execution time programmable) mask register. The 
use of this register is described in Section 3.7. 


6.3.2.10 Temporary Registers (TREGO, TREG1, TREG2) 


TREGO holds one of the multiplicands of the multiplier. It can also be loaded 
via the CALU with the following instructions: LT, LTA, LTD, LTP, LTS, SQRA, 
SQRS, MAC, MACD, MADS, and MADD. TREG1 holds a dynamic (execution- 
time programmable) shift count for the prescaling shifter. TREG2 holds a dy- 
namic bit address for the BITT instruction. 


6.3.2.11 Processor Mode Status Register (PMST) 


The processor mode status register (PMST) controls memory configurations 
of the ’C5x devices (with exception of the CNF bit in ST1). The PMST register 
is described in more detail in subsection 3.6.3 and in the configurability sec- 
tions of Chapter 6. 
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6.3.2.12 Serial Port Registers (DRR, DXR, SPC) 


Three registers control and operate the serial port. The serial port control reg- 
ister (SPC) contains the mode control and status bits of the serial port. The 
data receive register (ORR) holds the incoming serial data, and the data trans- 
mit register (DXR) holds the outgoing serial data. The serial port is described 
in more detail in Section 5.4. 


6.3.2.13 TDM Serial Port Registers (TRCV, TDXR, TSPC, TCSR, TRTA, TRAD) 


The TDM serial port is a feature superset of the first serial port. The TDM serial 
port supports applications that require serial communication in a multiproces- 
sing environment. The TDM serial port is described in more detail in Section 
5.4. 


6.3.2.14 Timer Registers (TIM, PRD, TCR) 


The timer operates with three registers. The TIM register is the current count 
of the timer. The PRD register defines the period for the timer. The TCR (timer 
control register) controls the operations of the timer. Refer to Section 5.6 for 
more details on the timer. 


6.3.2.15 Software Wait-State Registers (PDWSR, IOWSR, CWSR) 


The software wait-state registers contain the wait-state counts for the different 
banks of off-chip memory address ranges. PDWSR contains the wait-state 
count for the four 16K blocks of program and data memory. IOWSR contains 
the wait-state counts for the 16 partitions of |/O space. The CWSR control reg- 
ister determines the range of wait states you may select—(0, 1, 2, or 3) or (0, 
1, 3, 7). In addition, the BIG bit in the CWSR register determines how the I/O 
space is partitioned. If BIG is set to 0, the I/O wait states apply to the pair of 
port addresses. If the BIG bit is set to 1, the I/O wait states apply to 8K blocks 
of the I/O space. Refer to Section 5.3 for more details on software wait states. 


6.3.2.16 I/O Space Port Hole (PAO—15) 


The I/O space port hole allows the addressing of sixteen locations (50h—5Fh) 
of I/O space via the addressing modes of the local data space. This means that 
these locations can be read directly into the CALU or written from the ACC. It 
also means that these locations can be acted upon by the PLU or addressed 
via the memory-mapped addressing mode. The locations can also be ad- 
dressed with the IN and OUT instructions. 


6.3.2.17 Scratch Pad RAM 


This 32-word block of RAM can be used to hold overhead variables so that the 
larger blocks of RAM are not fragmented. This RAM block supports dual-ac- 
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cess operations and can be addressed by using the memory-mapped ad- 
dressing mode or any data memory addressing mode. 


6.3.3 Local Data Memory Addressing 


The local data space address generation is controlled by the decode of the cur- 
rent instruction. Local data memory is read via data address bus 1 (DAB1) on 
instructions with only one data memory operand and program address bus 
(PAB) on instructions with a second data memory operand. An instruction op- 
erand is provided to the CALU in eight ways, as described in subsection 3.4.2. 
However, data memory addresses are generated in one of the following five 
ways: 


(.) By the direct address bus (DAB) using the direct addressing mode (for ex- 
ample, ADD 010h) relative to the data page pointer (DP), 

[_.} By the direct address bus (DAB) using the memory-mapped addressing 
mode (for example, LAMM PMST) within data page zero, 

[-} By the auxiliary register file bus (AFB) using the indirect addressing mode 
(for example, ADD *), 

[-.]} By the value pointed at by the PC in long immediate address mode (for 
example, BLDD TBL1,*+), and | 

(-.} Bythe block memory address register (BMAR) in registered block memory 
addressing mode (for example, BLDD, BMAR*+). 


In the direct addressing mode, the 9-bit data memory page pointer (DP) points 
to one of 512 pages (1 page=128 words). The data memory address (dma), 
specified by the seven LSBs of the instruction, points to the desired word within 
the page. The address on the DAB is formed by concatenating the 9-bit DP 
with the 7-bit dma. 


Figure 6—5 illustrates the direct addressing mode. In the illustration, the oper- 
and is fetched from data memory space via the data bus, and the address is 
the concatenated value of the DP and the seven LSBs of the instruction. For 
the following example, consider DP = 018Dh and TEMP1 = 010h: 


LACC TEMP1 sACC = TEMP1. 


In the example, the accumulator is loaded with DATA(CE80). 


Local Data Memory nnn 


Figure 6—5. Direct Addressing Mode 


010h 


0010 0000 |oj001 o000 


Machine Code = 


DP = 11100 1110 1 / / 
\ \ / / 


DAB = 11100 %1110 3131001 0000 


Operand = Data(DAB) 


Note: DAB is the 16-bit internal address bus for data memory. 


The memory-mapped addressing mode operates much like the direct ad- 
dressing mode except that the most significant 9 bits of the address are forced 
to zero instead of being loaded with the contents of the DP. This makes it possi- 
ble to address the memory-mapped registers of data page zero directly with- 
out the overhead of changing the DP or auxiliary register. 


Figure 6-6 illustrates memory-mapped addressing mode. For the following 
example, consider DP = 0184h and TEMP1 = O8060h: 


LAMM 07h ;ACC = PMST 


In this example, the contents of memory location 7h is loaded into the accumu- 
lator. 


Figure 6-6. Memory-Mapped Addressing Mode 
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LAMM PMST 


MachineCode = 10000 1000 {0/000 0111 


/ / 
/ / 
Value = |0000 0000 Of / / 
/ / 
| / / 
DAB = {0000 0000 Oj000 0111 
Operand = Data(DAB) 


In the indirect addressing mode, the currently selected 16-bit auxiliary register 
AR(ARP) addresses the data memory through the AFB. While the selected 
auxiliary register provides the data memory address and the data is being ma- 
nipulated by the CALU, the contents of the auxiliary register can be manipu- 


Memory 


lated through the ARAU. See Figure 6—7 for an example of indirect auxiliary 
register addressing. In this case, ARS is the selected auxiliary register 


(ARP=3). 
Figure 6—7. Indirect Addressing Mode 
ADD * 
MachineCode = = 


ARP =(0 1 1]-PAR3 =[1100 1110 1001 0000 
Operand = Data(AR(ARP)) 


The following code illustrates the use of indirect addressing in a program: 


* This routine uses indirect addressing to calculate the following equation: 
* 
* 10 
* 
* \ X(I) x Y¥(T) 
* / 
* 
* I=] 
* 
* The routine assumes that the X values are located in on-chip RAM block BO, 
* and the ¥ values in block Bl. The efficiency of the routine is due to the 
* use of indirect addressing and the repeat instruction. 
* 
SERIESMAR *,AR4 sARP POINTS TO ADDRESS REGISTER 4. 

SETC CNF CONFIGURE BLOCK BO AS PROGRAM MEMORY. 

LAR AR4,#0300h ;POINT AT BEGINNING OF DATA MEMORY. 

RPTZ #9 *>CLEAR ACC AND P; REPEAT NEXT INST. 10 TIMES 

MAC OFFOO0h,*+ *MULTIPLY AND ACCUMULATE; INCREMENT AR4. 

APAC *> ACCUMULATE LAST PRODUCT. 
RET sAccumulator contains result. 


In the long immediate addressing mode, an operand is addressed by the sec- 
ond word of a two-word instruction. In this case, the program address/data bus 
(PAB) is used for the operand fetch. The prefetch counter (PFC) is pushed onto 
the microcall stack (MCS), and the long immediate value is loaded into the 
PFC. The PAB is then used for the operand fetch or write. At the completion 
of the instruction, the MCS is popped back to the PFC. The PC is incremented 
by two, and execution continues. This technique is used when two memory ad- 
dresses are required for the execution of the instruction. The PFC is used so 
that when the instruction is repeated, the address generated can be autoin- 
cremented. Figure 6-8 illustrates this mode. In this illustration, the source ad- 
dress (OPERAND1) is fetched via PAB, and the destination address (OPER- 
AND2) uses the direct addressing mode. 
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Figure 6-8. Long Immediate Addressing Mode 


BLDD 02345h,012h 


Machine Code1 = 1010 1000 |0j0 0 1 0010 
3 g y 
/ / 
bP =[7 i100 ti10 i) / / 
\ \ / / 
\ \X 7 / 
DAB = [1100 11710 1/001 0010 
Operand2 = Data (DAB) 
Machine Code2 = 0010 0011 0100 0101 
PC = 0010 0011 0100 0101 
Operand1 = Data (PC) 


The registered block memory addressing mode operates like the long immedi- 
ate addressing mode with the exception that the address comes from the 
BMAR register. The advantage of this technique over long immediate address- 
ing is that it allows the address of the block of memory to be changed in run- 
time. On the other hand, the address in long immediate addressing mode re- 
sides in the program flow and cannot be easily changed. Figure 6—9 shows an 
example of registered block memory addressing mode. 


Figure 6—9. Registered Block Memory Addressing Mode 
BLDD BMAR, 012h 


Machine Code1 = 1010 #110 0/0/0011 0010 


/ / 
/ / 
DP = / 
/ / 
\ pane 4 / 
DAB = [1100 1110 1001 0010 
Operand2 = Data (DAB) 
BMAR] —>| PFC _ 
Operand1 = Data (PFC) 


6-22 Memory 


_ Local Data Memory 


‘C5x devices provide a register file containing eight auxiliary registers 
(ARO—AR7). The auxiliary registers can be used for indirect addressing of the 
data memory or for temporary data storage. Indirect auxiliary register address- 
ing (see Figure 6—10) allows placement of the data memory address of an in- 
struction operand into one of the auxiliary registers. These registers are 
pointed to by a three-bit auxiliary register pointer (ARP) that is loaded with a 
value from 0 through 7, designating ARO through AR7, respectively. 


Figure 6-10. Indirect Auxiliary Register Addressing Example 


Auxillary Register File Data Memory Map 
| 0000h 
” AR1 | 51 50 hj 
Auxiliary 
Register Pointer 
AR2 E9FCh | 
(in STO) | agai 
ARP |0{1/1|—® AR3 [| FF 3 A hi—> meee i 
ana [103 Bh FFFFh | 
AR5 2681 4h 
AR6 000 8h 
AR7 8 43 Dh 


The auxiliary registers and the ARP can be updated directly from data memory, 
the accumulator, or the product register, or by an immediate operand defined 
in the instruction. The contents of these registers can also be stored in data 
memory or used as inputs to the CALU. These registers appear in the memory 
map as described in Table 6-8. 


The auxiliary register file (ARO—-AR7) is connected to the auxiliary register 
arithmetic unit (ARAU), shown in Figure 6-11. The ARAU can autoindex the 
current auxiliary register while the data memory location is being addressed. 
Indexing either by + 1 or by the contents of the INDX register can be performed. 
As a result, accessing tables of information does not require the central arith- 
metic logic unit (CALU) for address manipulation. The CALU can perform oth- 
er operations in parallel. 


If more advanced address manipulation is required, such as multidimensional 
array addressing, the CALU can directly read from or write to the auxiliary reg- 
isters. However, the ARAU updates of the ARs is done during the decode 
phase (second cycle) of the pipeline, whereas the CALU writes during the ex- 
ecution phase (fourth cycle) of the pipeline. Therefore, the two instructions di- 
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rectly following the CALU write to an auxiliary register should not use the same 
auxiliary register for address generation. 


Figure 6-11. Auxiliary Register File 
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As shown in Figure 6—11, the index register, the compare register, or the eight 
LSBs of the instruction register can be connected to one of the inputs of the 
ARAU. The other input is fed by the current AR (being pointed to by ARP). 
AR(ARP) refers to the contents of the current AR pointed to by ARP. The ARAU 
performs the functions shown in Figure 6—12. 
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Figure 6-12. ARAU Functions 


AR(ARP) + INDX —> AR(ARP) Index the current AR by adding a 16-bit un- 
signed integer contained in INDX. Exam- 
ple: ADD *0O+. 

AR(ARP) — INDX — AR(ARP) Index the current AR by subtracting a 16-bit 
unsigned integer contained in INDX. Ex- 
ample: ADD *0-. 

AR(ARP) + 1 — AR(ARP) Increment the current AR by one. Example: 
ADD *+. 

AR(ARP) — 1 — AR(ARP) Decrement the current AR by one. Exam- 
ple: ADD *-. 

AR(ARP) — AR(ARP) Do not modify the current AR. Example: 
ADD *. 

AR(ARP) + IR(7—0) — AR(ARP) Add an 8-bit immediate value to current 
AR. Example: ADRK #055h. 

AR(ARP) — IR(7—0) —- AR(ARP) Subtract an 8-bit immediate value from cur- 
rent AR. Example: SBRK #055h. 


AR(ARP) + rc(INDX) — AR(ARP) Bit-reverse indexing; add INDX with re- 


verse-carry (rc) propagation. Example: 
AR(ARP) — rc(INDX) — AR(ARP) 


ADD *BRO+. 


Bit-reverse indexing; subtract INDX with 
reverse-carry (rc) propagation. Example: 
ADD *BRO-. 


Compare current AR_ with ARCR and if 
condition is true, then set TC bit of the 

status register (ST1) to one. If false, then 
clear TC. Example: CMPR 3. 


If at end of circular buffer, reload start 
address. 


If (AR(ARP) == ARCR), then TC = 1 
if (AR(ARP) < ARCR), then TC = 1 
If((AR(ARP) > ARCR), then TC = 1 
If(AR(ARP) » ARCR), then TC = 1 


AR(ARP) =CBSR 
The index register (INDX) can be added to or subtracted from AR(ARP) on any 
AR update cycle. This 16-bit register is one of the memory-mapped registers 
and is used to increment or decrement the address in steps larger than one 
for operations such as addressing down a column of a matrix. The auxiliary 
register compare register (ARCR) is used as a limit to blocks of data and, in 
conjunction with the CMPR instruction, supports logical comparisons between 
AR(ARP) and ARCR. The ’C2x devices use ARO for these two functions. Upon 
reset, a LAR load of ARO also loads INDX and ARCR to maintain compatibility 
with the 'C2x devices. To avoid loading the INDX and ARCR registers on an 
ARO load, the NDX bit of the PMST register is set to one. For the following ex- 
ample, assume INDX = 010h, ARP = 3, and ARS = 0200h: 


ADD*0+,4,AR5 ;ACC += addressed value shifted left 4. 


In the example, DATA(200) is shifted left 4 bits and added to the ACC, AR3 is 
incremented by 10h, and ARP is changed to 5. 
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The ’'C5x supports two circular buffers operating at a given time. These two 
circular buffers are controlled via the circular buffer control register (CBCR). 
The CBCR is defined in Table 6-9. 


Table 6-9. Circular Buffer Control Register 
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| 0-2 | CAR1 | Identifies which auxiliary register is mapped to circular buff- | 
er 1 


3 | CENB1 | Circular buffer 1 enable=1/disable=0. Set to 0 upon reset 


4-6 | CAR2 Identifies which auxiliary register is mapped to circular buff- 
er2 


CENB2 | Circular buffer 2 enable=1/disable=0. Set to 0 upon reset 


Upon reset (R& rising edge), both circular buffers are disabled. To define a cir- 
cular buffer, load the CBSR1/2 with the start address of the buffer and 
CBER1/2 with the end address. Load the auxiliary register to be used with the 
buffer with an address between the start and the end, load CBCR with the ap- 
propriate auxiliary register number, and set the enable bit. As the address is 
stepping through the circular buffer, the update is compared against the value 
contained in CBER1/2. When those values are equal and any AR modification 
occurs, the value contained in CBSR1/2 is automatically loaded into the AR. 
For the following example, assume CBSR1 = 0200h, CBER1 = 0203h, CBCR 
= OCh, AR4 = 0203h, and ARP = 4: 


ADD*+ ;ACC += addressed value at 203h. 


At the completion of the instruction, AR4 = 0200h. 


Circular buffers can be used with either increment- or decrement-type up- 
dates. If increment updates are used, then the value in CBER must be greater 
than the value in CBSR. If decrement updates are used, the value in CBER 
must be less than the value in CBSR. The other indirect addressing modes 
may also be used; however, the ARAU tests only for the condition 
AR(ARP)=CBER. The ARAU will not wrap around if an AR update steps over 
the value contained in CBER. Note that the test in the ARAU is performed be- 
fore the auxiliary register update. Refer to subsection 4.1.6 for details. 
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6.3.4 External Interfacing to Local Data Memory 


The 'C5x devices can address up to 64K words of off-chip local data memory. 
These are the key signals for this interface: 


AOQ-A15 
DO—D15 


16-Bit Bidirectional Address Bus 

16-Bit Bidirectional Data Bus 

Data Memory Select 

External Memory Access Active Strobe 

Read Select (External Device Output Enable) 
Write Enable 

Memory Ready to Complete Cycle 

Request for Control of Memory Interface 
Acknowledge HOLD Request 

Bus Request 

Acknowledge Bus Request (when HOLDA is low) 


An example of an external RAM interface is shown in Figure 6—13. In this fig- 
ure, the ’C5x device interfaces to four 16K x 4-bit RAM devices. The data 
memory select (DS) is directly connected to the chip select (CS) of the devices. 
This means the external RAM block will be addressed in any of the four 16K 
banks of local data space. If there are additional banks of off-chip data 
memory, a decode circuit that gates DS with the appropriate address bits can 
be used to drive the memory block chip select. 
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Figure 6-13. Interface to External RAM 
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The RD signal is tied directly to the output enable (OE) pin of the RAMs. This 
signal enables the output drivers of the RAM and turns them off in time to pre- 
vent data bus conflicts with an external write by the ’'C5x device. If the RAM 
device does not have an OE pin, then DS should be gated with STRB and con- 
nected to the CS pin of the RAM to implement the same function. The WE sig- 
nal of the ’C5x is tied to the WE signal of the RAM. The ’C5x takes at least two 
cycles on all external writes, including a half cycle before the WE goes low and 
a half cycle after WE goes high; this prevents buffer conflicts on the external 
buses. Additional wait states can be generated with the software wait-state 
generators. 
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6.4 Global Memory 


For multiprocessing applications, the 'C5x devices are capable of allocating 
global data memory space and communicating with that space via the BR (bus 
request) and READY control signals. In addition, this capability can be used 
to extend the data memory address map by overlaying the address space. 


Global memory is memory shared by more than one processor. Therefore, ac- 
cess to it must be arbitrated. When global memory is used, the processor's ad- 
dress space is divided into local and global sections. The local section is used 
by the processor to perform its individual function, and the global section is 
used to communicate with other processors. This implementation facilitates 
shared data multiprocessing in which data is transferred between two or more 
processors. Unlike a direct memory access (DMA) between two processors, 
reading or writing global memory does not require that one of the processors 
be halted. 


6.4.1. Global Memory Configurability 


A memory-mapped global memory allocation register (GREG) specifies part 
of the ’C5x data memory as global external memory. The register, GREG, 
memory-mapped to data memory address location 5h, is an eight-bit register 
connected to the eight LSBs of the internal data bus. The upper eight bits of 
location 5 are nonexistent and are read as ones. 


The contents of GREG determine the size of the global memory space be- 
tween 256 and 32K words. The legal values of GREG and corresponding glob- 
al memory spaces are shown in Table 6—10. Note that values other than those 
listed in the table lead to fragmented memory maps and should be avoided. 


Table 6-10. Global Data Memory Configurations 


GREG Value Local Memory Global Memory 


~ 000000Xx | Oh-OFFFFh 65,536 
10000000 Oh—07FFFh 32,768 ; 08000h—-OFFFFh 
11000000 Oh-OBFFFh 49,152 0C000h—OFFFFh 


11100000 | Oh—ODFFFh 57,344 OEQOOh—OFFFFh 
11110000 Oh-—OEFFFh 61,440 OFOO00Oh—OFFFFh 
11111000 Oh-OF7FFh 63,488 OF800h—OFFFFh 
11111100 | Oh—-OFBFFh 64,512 OFCOOh—OFFFFh 
11111110 Oh—OFDFFh 65,024 OFEOOh—OFFFFh 
11111111 Oh-—OFEFFh 65,280 OFFOOh—OFFFFh 
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6.4.2 Global Memory Addressing 


When a data memory address, either direct or indirect, corresponds to a global 
data memory address (as defined by GREG), BR is asserted low with DS to 
indicate that the processor wishes to make a global memory access. External 
logic then arbitrates for control of the global memory, asserting READY when 
the ’C5x device has control. The length of the memory cycle is controlled by 
the READY signal. In addition, the software wait-state generators can be used 
to extend the access times for slower, external memories. The wait-state gen- 
erators corresponding to the overlapped memory address space in local data 
space will generate the wait states for the corresponding addresses in global 
data memory space. 


6.4.3 External Interfacing of Global Memory 


Global memory can be used in various digital signal processing tasks, such 
as filters or modems, where the algorithm being implemented may be divided 
into sections with a distinct processor dedicated to each section. With multiple 
processors dedicated to distinct sections of the algorithm, throughput may be 
increased via pipelined execution. Figure 6—14 illustrates an example of a 
global memory interface. Since the processors can be synchronized by using 
the RS pin, the arbitration logic may be simplified and the address and data 
bus transfers made more efficient. 


Figure 6-14. Global Memory Interface 
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The global memory interface can also be used to extend the data memory ad- 
dress map beyond the reach of the 16-bit address bus by paging in an addition- 
al 32K words. Loading the GREG register with the appropriate value can over- 
lay the local data memory with additional memory, starting at the highest 
memory address (OFFFFh) and moving down. This additional memory is dif- 
ferentiated from local memory accesses by the BR pin being active low. The 
rest of the memory interface control signals (STRB, DS, etc.) behave identical- 
ly on a local or global data access. 
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6.5 Input/Output Space 


The ’C5x devices support an I/O address space of 64K 16-bit parallel input and 
output ports. I/O ports allow access to peripherals typically used in DSP appili- 
cations such as codecs, digital-to-analog (D/A) converters, and analog-to-digi- 
tal (A/D) converters. This section discusses addressing I/O ports and interfac- 
ing I/O ports to external devices. 


6.5.1. Addressing Input/Output Ports 


Access to external parallel I/O ports is multiplexed over the same address and 
data bus for program/data memory accesses. I/O space access is distin- 
guished from program/data memory accesses by the IS signal going active 
low. All 65,536 ports can be accessed via the IN and OUT instructions, as 
shown in the following example: 


IN DAT7,0OFFFEh;Read data to data memory from external 
;device on port 65534. 

OUT DAT7,0OFFFFh ;Write data from data memory to external 
s;device on port 65535. 


Sixteen of the 64K I/O ports are mapped in data memory space as shown in 
Table 6—4. The I/O ports may be accessed with the IN and OUT instructions 
along with any instruction that reads or writes a location in data space. In this 
way, I/O is treated the same way as memory. The following example illustrates 
the use of direct addressing to access an I/O device on port 51h: 


SACL 5lh :(DP = 0) Store accumulator to external 
;device on port 81. 


Accesses to memory-mapped I/O space are also distinguished from program/ 
data accesses by the IS signal. DS is not active, even though the user is writing 
to data space. 


6.5.2 Interfacing to I/O Ports 


The RD and WE signals can be used along with chip-select logic to output data 
to an external device. The port address can be decoded and used as a chip 
select for the input or output device. The access times to 1/O ports can be mo- 
dified through the CWSR and lIOWSR software wait-state registers. The BIG 
bit in the CWSR register determines how the I/O space is mapped to the soft- 
ware control registers. If the BIG bit is set to 0 in the CWSR register, the first 
sixteen ports are assigned in pairs to a software wait-state generator. Each fol- 
lowing set of 16 registers maps accordingly to the first 16 ports when BIG = 0. 
For example, the 16 ports that correspond to the addresses in the data space 
port hole (ports 50h—5Fh) have the same wait states as ports O-Fh. If the BIG 
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bit is set to 1, the wait states are mapped to program space in eight 8K blocks 
of memory. The following table shows how the software wait states are as- 
signed to I/O ports according to the BIG bit: 


I/O Ports When I/O Ports When 


IWSR Bits BIG=0 BIG=1 
0-1 Port 0/Port 1 Ports 0000h—1FFFh 
2-3 Port 2/Port 3 Ports 2000h—3FFFh 
4—5 Port 4/Port 5 Ports 4000h—5FFFh 
6—7 Port 6/Port 7 Ports 6000h—7FFFh 
8-9 Port 8/Port 9 Ports 8000h—9FFFh 
10-11 Port 10/Port 11 Ports AO00h—BFFFh 
12-13 Port 12/Port 13 Ports C000h—DFFFh 
14-15 Port 14/Port 15 Ports EOOOh—FFFFh 


See Section 5.3 for details. 
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6.6 Direct Memory Access (DMA) 


The ’C5x supports multiprocessing designs using direct memory access 
(DMA) of external memory or the 'C5x on-chip single access RAM. The DMA 
feature can be used for multiprocessing by temporarily halting the execution 
of one or more processors to allow another processor to read from or write to 
the 'C5x’s local off-chip memory or on-chip single-access RAM. You can con- 
trol the external memory access via the HOLD/HOLDA signals. The DMA ac- 
cess of internal RAM on the ’C5x is controlled by the HOLD, HOLDA, R/W, 
STRB, BR, and IAQ lines. 


The multiprocessing is typically a master-slave configuration. The master may 
initialize a slave by downloading a program into its program memory space 
and/or may provide the slave with the necessary data by using external 
memory to complete a task. In a typical 'C5x direct memory access scheme, 
the master may be a general-purpose CPU, another ’C5x, or even an ana- 
log-to-digital converter. A simple 'C5x master-slave configuration is shown in 
Figure 6—15. 


Figure 6—15. Direct Memory Access Using a Master-Slave Configuration 


"C5x 
(Master) 


MasterData Master Program | Slave Program Siave Data 
| Memory (RAM) Memory (ROM) Memory (RAM) Memory (RAM) 


The master ’C5x device takes complete control of the slave’s external memory 
by asserting HOLD low via its external flag (XF). This causes the slave to place 
its address, data , and control lines in a high-impedance state. 


After control of the slave’s buses is given up to the master processor, the slave 
alerts the master of the fact by asserting HOLDA. This signal may be tied to 
the master 'C5x BIO pin. The slave’s XF pin may be used to indicate to the 
master when it has finished performing its task and needs to be reprogrammed 
or requires additional data to continue processing. In a multiple-slave configu- 
ration, priority of each slave’s task may be determined by tying the slave’s XF 
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signals to the appropriate INT(4, 3, 2, or 1) pin on the master ’C5x device. The 
external bus interface of the slave 'C5x device is put in high-impedance mode 
when its HOLDA signal is asserted. While the HOLDA is active, the processor 
can continue running code out of its internal memory (internal ROM or single/ 
dual access RAM) if it is in concurrent hold mode (status bit HM is 0). However, 
TAQ pin does not indicate instruction acquisition, once HOLDA goes active. 
Otherwise, the processor will halt internal execution (status bit HM is 1). See 
Section 3.8 for interaction between HOLD R&, and external interrupts. 


A PC environment presents another example of a potential direct memory ac- 
cess scheme in which a system bus (the PC bus) is used for data transfer to 
external 'C5x memory. In this configuration, either the master CPU or a disk 
controller may place data onto the system bus, which can be downloaded into 
the local memory of the ’'C5x device. In this case, the 'C5x acts more like a pe- 
ripheral processor with multifunction capability. In aspeech application, for ex- 
ample, the master can load the ’C5x program memory with algorithms to per- 
form such tasks as speech analysis, synthesis, or recognition, and can fill the 
'C5x data memory with the required speech templates. In another application 
example, the 'C5x can serve as a dedicated graphics engine. Programs can 
be downloaded via the system bus into program RAM. Data can come from 
PC disk storage or can be provided directly by the master CPU. 


Figure 6—16 depicts a direct memory access using a PC environment. In this 
configuration, decode and arbitration logic are used to control the direct 
memory access. When the address on the system bus resides in the local 
memory of the peripheral 'C5x, this logic asserts the HOLD signal of the 'C5x 
while sending the master a not-ready indication to allow wait states. After the 
’‘C5x acknowledges the direct memory access by asserting HOLDA, READY 
is asserted and the information is transferred. 
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Figure 6-16. Direct Memory Access in a PC Environment 
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The ’C5x also provides direct access of the on-chip single-access RAM for ex- 
ternal devices. DMA of the on-chip single-access RAM requires the following 


Disk Controller 


Address 


Data 


signals: 

HOLD External request for control of address, data, and control lines. 

HOLDA Indicates to external circuitry that the memory address, data, 
and control lines are in high impedance, allowing external ac- 
cess of on-chip single-access RAM. 

BR Bus request signal. Externally driven low in hold mode to indi- 
cate a request for access to on-chip single-access RAM. 

TAQ Acknowledge BR request for access to on-chip single-access 
RAM while HOLDA is low. 

R/W Read/write signal indicates the data bus direction for DMA reads 
(high) and DMA writes (low). 

STRB When active low and IAQ and HOLDA are low, this input signal is 


used to select the memory access. STRB determines the dura- 
tion of the memory access. 

A(15-0) Address inputs during HOLDA and ER active low. 

D(i5-0) | DMA data. 


To access the 'C5x device’s on-chip single-access RAM, a master processor 
must control the device. The master processor initiates a DMA transfer by 
placing the 'C5x device in HOLD. Once the device responds with a HOLDA, 
the master can select access to the internal on-chip single-access RAM by 
lowering the BR input. The device responds with an IAQ to acknowledge ac- 
cess to the on-chip memory. Once access is granted, the master drives the 
R/W signal to indicate the direction of the transfer. On a DMA write, the master 
must drive the address and data lines for a write. On a DMA read, the master 
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must drive the address lines and latch the data. Each memory access (read 
or write) must be selected by the STRB signal. External access wait states are 
added by extending the STRB signal. The address decode of the DMA access 
includes only A13—-A0, (A14 and A15 ignored). The ranges shown in 
Table 6-11 respond during DMA access, effectively overlaying A13—A0. 


Table 6—11. Address Ranges for On-Chip Single-Access RAM DMA 


Address Bus Hex Address 
Range 


A13—A0 used 0000—23FF 
A15, A14 ignored 4000—63FF 
8000—A3FF 
CO00—E3FF 


‘AS-AO used 0000—03FF 


A13—-A10 must be 0 4000—43FF 
A15—A14 ignored 8000—83FF 
C000—C3FF 


A11—A0 used 0000—0BFF 
A13—A12 must be 0 4000—4BFF 
A15—A14 ignored 8000—8BFF 

C000—CBFF 


Note that the above address ranges correspond to 9K/1K/3K words of on-chip 
single-access RAM of the ’C50/51/53, respectively. For example, writing to the 
address 01h (using DMA) on a’C50 affects the second memory location of the 
on-chip single-access RAM. Furthermore, writing to the address 4001h on 
‘C50 is equivalent to writing to the address Oth, as shown in Table 6-11. 


6-36 Memory 


Memory Management 


6.7 Memory Management 


6.7.1 Block Moves 


The 'C5x devices have a programmable memory map, which can vary for each 
application. Instructions are provided for integrating the device memory into 
the system memory map. The ’C50 device includes 2K words of boot ROM, 
9K words of single-access RAM, and 1056 words of dual-access RAM. The 
‘C51 device includes 8K words of program ROM, 1K words of single-access 
RAM, and 1056 words of dual-access RAM, wheras the ’C53 has 16K words 
of on-chip ROM, 3K words of single-access and 1056 words of dual-access 
RAM. Examples of moving and configuring memory are provided in this sec- 
tion. 


The ’C5x devices address a large amount of memory but are limited in the 
amount of on-chip memory. Several instructions are available for moving 
blocks of data from off-chip slower memories to on-chip memory for faster pro- 
gram execution. In addition, data can be transferred from on-chip to off-chip 
for storage or multiprocessor applications. 


The BLDD instruction facilitates the transfer of data from external or internal 
data memory to internal or external data memory. Example 6—1 illustrates the 
use of the BLDD command to move data (for example, a table of coefficients) 
from external memory to internal data RAM. 


Example 6—1.Moving External Data to Internal Data Memory With BLDD 


+ £¢ + 


MOVED LACC #8000h 
SAMM BMAR 


This routine uses the BLDD instruction to move external data memory to 
internal data memory. 


;BMAR contains source address in data memory. 


LAR AR7,#300h ;AR7 contains destination address in data memory. 


MAR *,AR7 
RPT #511 


*LARP = AR7. 
;Move 512 values to data memory block Bl. 


BLDD BMAR, *+ 


For systems with external data memory but no external program memory, the 
BLDP instruction can be used to move additional blocks of code into internal 
program memory. Example 6-2 illustrates the use of the BLDP instruction. 


6-37 


Memory Management 


Example 6-2.Moving Data Memory to Program Memory With BLDP 


* 
* This routine uses the BLDP instruction to move external data memory to 
* internal program memory. This instruction could be used to boot load a 
* program to the on chip program RAM from external data memory. 
* 
MOVEDPLACC #2000h | 
SAMM BMAR *BMAR contains dest. address in program memory (’C51) 
LAR AR7,#0F000h ;AR7 contains source address in data memory 
MAR *,AR7 * ARP=AR7 
RPT #1023 :Move 1k of data to program memory space 
BLDP *+ 
RET 


When no external data memory is available, program memory may contain 
necessary coefficient tables that should be loaded into internal data memory. 
The routine in Example 6-3 illustrates the use of the BLPD instruction to per- 
form this function. 


Example 6—3.Moving Program Memory to Data Memory With BLPD 


* 
* This routine uses the BLPD instruction to move external program memory to 
* internal data memory. This routine is useful for loading a coefficient 
* table stored in external program memory to data memory when no external 
* data memory is available. 
* 
MOVEPDLAR AR7,#300h ;AR7 points to destination in data memory 

MAR *,AR7 ; ARP=AR7 

RPT #127 ;Move 128 values from external program to 

BLPD #0FD00h,*+ ;internal data memory. 

RET 


Another method of transferring data between memory spaces uses the TBLR 
and TBLW instructions. These instructions can specify a calculated, rather 
than predetermined, location of a block of data in program or data memory for 
transfer. The following examples illustrate the use of the TBLR and TBLW in- 
structions. 
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Example 6—4.Moving Program Memory to Data Memory With TBLR 


* 
* This routine uses instruction TBLR to move program memory to data memory 
* space. It differs from the BLPD instruction in that the accumulator 
* contains the address in program memory from which to transfer. This allows 
* for a calculated, rather than predetermined, location in program memory to 
* be specified. The calling routine must load accumulator with the source 
* address. 
* 
TABLERMAR *,AR3 ; ARP=AR3 
LAR AR3,#300h ;AR3 contains destination in data memory 
RPT #127 sMove 128 items to data memory block B2 
TBLR *+ sAccumulator contains external program 
RET ;memory address. 


Example 6—5.Moving Data Memory to Program Memory With TBLW 


* 


* This routine uses the TBLW instruction to move data memory to 
* program memory. The calling routine must contain the destination program 


* memory address in the accumulator. 
* 


TABLEWMAR *,AR4 ;LARP = AR4. 
LAR AR4,#300h ;AR4 contains source address in data memory 
RPT #511 *Move 512 items from data memory to program 
TBLW *+ ;mMemory. 
RET ;Accumulator contains address of program RAM. 


The IN and OUT instructions move data from data memory to an external port. 
The use of these instructions is shown in Example 6-6 and Example 6—7. 


Example 6—6.Moving Data From I/O Space to Data Memory With SMMR 


* 
* This routine uses the SMMR instruction to move data from a memory—mapped 
* I/O port to local data memory. Note that 16 I/O ports are mapped in data 
* page 0 of the ‘C5x memory map. 
* 
INPUT: 

LDP #0 

RPT #511 ;Input 512 values from port 5lh to 

SMMR 51h,800h stable at 800h in data memory. 

RET 


Example 6—7.Moving Data From Data Memory to I/O Space With LMMR 


* 
* This routine uses the LMMR instruction to move data from local data 
* space to a memory—mapped I/O port. Note that 16 I/O ports are mapped 
* in data page 0 of 'C5x memory map. 
* 
OUTPUT: 

LDP #0 sdata page 0 

RPT #63 ;Output 64 values from a table at 800h 

LMMR 50h, 800h sin data memory to port 50h. 

RET 
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6.7.2 Boot Loader (’C50) 


The main function of the boot loader is to transfer user code from an external 
source to the program memory at power-up. The ’C50 provides different ways 
to download the code to accommodate various system requirements. For 
some applications, a serial interface is appropriate. For others, a parallel inter- 
face is appropriate if the code is already stored in external ROM. 


Ifthe MP/MC pin of the ’C50 is sampled low during a hardware reset, execution 
begins at location zero of the on-chip ROM. The on-chip ROM is factory pro- 
grammed with a boot-load program. 


The boot-load program sets up the CPU status registers before initiating the 
boot load. Interrupts are globally disabled (INTM=0), internal dual-access 
RAM block BO is mapped in program space (CNF=1), and the on-chip single- 
access RAM block is enabled in program space (RAM=1, OVLY=0). Seven 
wait states are selected for the entire program and data spaces. Initially, the 
32K words of global data memory are enabled in data space 08000h to 
OFFFFh. After the code transfer is complete, the global memory is disabled be- 
fore control is transferred to the destination address. 


The boot routine reads the global data memory location OFFFFh by driving the 
bus request (BR) and data strobe (DS) pins low. The lower eight bits of the 
word read from global memory location OFFFFh specify the mode of transfer. 
The rest of the bits are ignored by the boot loader. 


Figure 6—17 lists available boot options and corresponding configuration byte 
patterns. 


Figure 6-17. Boot Routine Selection Word 
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15 8 7 43 0 At Address FFFFh 


_ xxoooxx_ 0K 0000_| 8 -Bit Serial Mode 
16-Bit Serial Mode 


[3000000K [200K] 1700_] 16 Bit Paratiet 70 Mode 
| XXXXXXXX SRC 01 | 8-Bit Parallel EPROM Mode 


LXXXXXXXX_ | SRC 10_ | 16-Bit Parallel EPROM Mode 
ce | Warm Boot 


Note: X = Don'tcare condition 
SRC = 6-bit page address for parallel modes 


ADDR = _ 6-bit page address for warm boot 


Memory 
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Parallel Boot 


The parallel boot option is used if the code is stored in EPROMs (8-bit or 16-bit 
wide in global data space). The code is transferred from global data memory 
to program memory. The six MSBs of the source address are specified by the 
SRC field of the boot routine selection (BRS) word as shown in Figure 6—17. 
A 16-bit EPROM address is defined by this SRC field as shown in Figure 6-18. 


Figure 6-18. 16-Bit EPROM Address 


15 10 9 


ee 


Source Address 


If the16-bit parallel mode is selected, data is read in 16-bit words from the 
source address, incrementing the address by one after every read operation. 
The destination address destination;g and the length /ength;, of the code are 
specified by the first two 16-bit words. The length Nis defined as: 


length N = number of 16-bit words to be transferred — 1 


The number of 16-bit words specified by the parameter N do not include the 
first two words read, starting from the source address — that is, the destination 
and length parameters. This is shown in Figure 6—19. The code is transferred 
from the global data memory to the program memory. There is at least a four- 
instruction cycle delay between aread from EPROM and awrite to the destina- 
tion address. This ensures that if the destination is external memory (such as 
fast SRAM), there is enough time to turn off the source memory (EPROM) be- 
fore the write operation is performed. 


Figure 6-19. 16-Bit Parallel Boot 


16-Bit Data Bus 


Code Word(1)46 | 
| Code Word(N)16 | 


Destination,, 16-bit destination address. 
Lengthie 16-bit word that specifies the length of the code (N) that follows it. 
Code Word(N);g5 —_N 16-bit words to be transferred. 


After the specified length of code words are transferred to the program 
memory, the ’C50 branches to the destination address. | 
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If the 8-bit parallel boot option is selected, two consecutive memory locations 
(starting at source address) are read to make one 16-bit word. The high-order 
byte should be followed by the low-order byte. Data is read from the lower eight 
data lines, ignoring the upper byte on the data bus. The destination address 
is a 16-bit word that constitutes address in program space where the boot code 
is transferred. The length N is defined as: 


length N = number of 16-bit words to be transferred — 1 
length N = (number of bytes to be transferred -- 2) — 1 


The number of 16-bit words specified by the parameter N do not include the 
first four bytes (or first two words) read, starting from the source address —that 
is, the destination and length parameters. This is shown in Figure 6-20. The 
code is transferred from the global data memory to the program memory. There 
is at least a four-instruction cycle delay between a read from source memory 
(such as EPROM) and a write to the destination address. This ensures that if 
the destination is external memory (such as fast SRAM), there is enough time 
to turn off the source memory (EPROM) before the write operation is per- 
formed. 


Figure 6—20. 8-Bit Parallel Boot 


7 Lower 8 data lines 0 


Code Word(N 


Code Word(N), 


Destination, 
Destination, 


Length, 
Length 


High and low bytes of destination address 


High and low bytes of a 16-bit word that specifies the length 
N of the code that follows it 


Code Word(1),, N high and N low bytes that constitute N words to be trans- 
Code Word(1); ferred 

Code Word(N)p 

Code Word(N), 
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Serial Boot 


V/O Boot 


In the serial boot option, the serial port control register (SPC) is set to OxxF8h 
or OxxFCh for 16-bit and 8-bit modes, respectively. The RRST and XRST bits 
are each set to 1, taking the serial port out of reset. FSM is set to 1, configuring 
the serial port in frame sync mode — that is, frame sync pulses are required 
to be supplied externally on the FSR pin. The value of the FO bit is set accord- 
ing to the mode selected (8- or 16-bit modes). The external flag XF signals that 
the 'C50 is ready to respond to the serial port receive section. XF is set to high 
at reset and is driven low to initiate reception. No frame sync pulses should 
appear on the FSR before XF going low. The receive clock must be supplied 
by a device external to the ’C50. 


In the case of 16-bit serial mode, the first 16-bit word received by the device 
from the serial port specifies the destination address of boot code in program 
memory. The next 16-bit word specifies the length of the actual code that fol- 
lows it. The length N is defined as: 


length N = number of 16-bit words — 1 


Note that the number of 16-bit words specified by the parameter N do not in- 
clude the first two words read, starting from the source address — that is, the 
destination and length parameters. In the case of an 8-bit serial transfer, a 
higher-order byte followed by a low-order byte constitute a 16-bit word. The 
first 16-bit word received by the device from the serial port specifies the des- 
tination address of boot code in program space. The following 16-bit word 
specifies the length of the actual code that follows it. The length N is defined 
as: 


length N = number of 16-bit words — 1 
length N = (number of bytes to be transferred —- 2) — 1 


After the specified length of code words is transferred to program memory, the 
‘C50 branches to the destination address. 


The I/O boot mode is provided to asynchronously transfer code from I/O port 
50h to internal/external program memory. Each word may be either 16 bits or 
8 bits long. The C50 communicates with the external device by using BIO and 
XF handshake lines. The handshake protocol shown in Figure 6—21 is re- 
quired to succesfully transfer each word from port 50h: 
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Host Request 
Data Xmit 


Warm Boot 


’C50 Ready | Host Data Deas 
to Receive 


Valid Acknowledges 
| Data Received 


One Word Transfer Next Word Transfer 


If the 8-bit transfer mode is selected, the lower eight data lines are read from 
port 50h. The upper byte on the data bus is ignored. The ’'C50 reads two 8-bit 
words to form a 16-bit word. The low byte of a 16-bit word should follow the 
high byte. 


For both 8-bit and 16-bit |/O modes, the first two 16-bit words received by the 
‘C50 must be the destination and the length of the code, respectively. See the 
parallel boot description for destination and length code words. A minimum 
delay of four clock cycles is provided between the XF rising edge and the write 
operation to the destination address. This allows the host processor sufficient 
time to turn off its data buffers before the 'C50 initiates write operation (if des- 
tination is external memory). Note that the ‘C50 accesses the external bus 
only when XF is high. 


The warm boot option can be used if the program has already been transferred 
to internal (or external) memory by other means (for example, DMA), or if it is 
only a warm device reset. In this case, six MSBs of the 8-bit long BRS word 
specify the entry point of the code as shown in Figure 6—22. 


Figure 6—22. Warm Boot 


15 10 0 
ADDR poo} o] a] of of of of of o 
Entry Address 


The ’C50 transfers control to the entry address if a warm boot is specified. 
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The’C5x digital signal processors maintain source code compatibility with 'C1x 
and ’C2x generations and have architectural enhancements that improve per- 
formance and versatility. An orthogonal instruction set is augmented by new 
instructions that support additional hardware and handle data movement and 
memory-mapped registers. Other features include an independent parallel 
logic unit (PLU) for performing Boolean operations, a 32-bit accumulator buff- 
er, and a set of registers that provide zero-latency context-switching capabili- 
ties to interrupt service routines. The on-chip dual-access RAM and 
memory-mapped register set are enhanced. 


This chapter explains the use of ’C5x instruction set with particular emphasis 
on its new features and special applications. For a complete discussion of the 
assembler directives used in this chapter’s examples, please consult the 
TMS320 Fixed-Point DSP Assembly Language Tools User's Guide, literature 
number SPRUO18B. Major topics discussed in this chapter are listed below. 


Topic Page 
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Processor Initialization 


Before executing a digital signal processing algorithm, it is necessary to initial- 
ize the processor. Generally, initialization takes place anytime the processor 
is reset. 


The processor is reset by applying a low level to RS input; the IPTR bits of 
PMST register are all cleared, thus mapping the vectors to page zero in pro- 
gram memory space. This means that the reset vector always resides at pro- 
gram memory location 0. This location normally contains a branch instruction 
to direct program execution to the system initialization routine. A hardware re- 
set clears all pending interrupt flags and sets the INTM (global enable inter- 
rupts) bit to 1, thereby disabling all interrupts. It also initializes various status 
bits and peripheral registers. Refer to subsection 3.8.1 for details. 


To configure the processor after the reset, the following internal functions 
should be initialized. 


Memory-mapped core processor and peripheral control registers 
Interrupt structure (INTM) 

Mode control (OVM, SXM, PM, AVIS, NDX, TRM) 

Memory control (RAM, OVLY, CNF) 

Auxiliary registers and the auxiliary register pointer (ARP) 

Data memory page pointer (DP) 


OUUOUOUO 


The OVM (overflow mode), TC (test/control flag), IMR (interrupt mask regis- 
ter), auxiliary register pointer (ARP), auxiliary register pointer buffer (ARB), 
and data memory page pointer (DP) are not initialized by reset. 


Example 7—1 shows coding for initializing the 'C5x to the following machine 
state, and for the initialization performed during hardware reset: 


Internal single-access RAM configured as program memory 
Interrupt vector table loaded in internal program memory 
Interrupt vector table pointer (IPTR) 

Internal dual-access RAM blocks filled with zero 

Interrupts enabled 


ODOUUUO 


Software Applications 


Example 7-1. Initialization of ’'C5x 


etitle ‘PROCESSOR INITIALIZATION’ 
-mmregs 

ref ISRO, ISR1,ISR2,ISR3,ISR4, TIME 
ref RCV, XMT, TRCV, TXMT, TRP, NMISR 


MAIN PRG set 04000h sprogram space address of main 


foreground routine 


e 
o 
e 
g 


MITTEE TTT TET T ETOP TET EOETOTEEEOEFOREOEOEEEPEFEOTEOPEFETEOET ORES 

* Processor initialization for TMS320C50. 

* 

* For the TMS320C51, the memory mapping of S/A RAM in program 

* space and data space is not identical. Therefore, memory location 

* pointed to by address 0800h in data space is mapped to address 

* 02000h in program space. Hence, the vector table must be loaded 

* at data memory 0800h in order to keep the vector table address 

* 02000h in program space. 

MOTT TTT OTTO ET TOT ETE E EOE EEOEETOTEOPEFOE EOF EIP ETOP ETT OEEEEOTEEEEE 

V_TBL -sect “vectors” 

RESET B_ INIT :This section will be loaded in program 
;memory address Oh. 

INT1 B  ISR1 ;INT1— begins processing here 

INT2 B ISR2 ;INT2— begins processing here 

INT3 B  ISR3 :INT3— begins processing here 

TINT B TIME :Timer interrupt processing 

RINT B RCV *Serial port receive interrupt 

XINT B XMT :Serial port transmit interrupt 

TRNT B TRCV *TDM port receive interrupt 

TXNT B TXMT 7TDM port transmit interrupt 

INT4 B ISR4 ;INT4— begins processing here 


-Space 14*16 714 words 


TRAP B  TRP 


NMI B NMISR 
- text 
INIT LDP #0 ;Initialize data pointer 
OPL #20h,PMST ;Configure S/A RAM in data memory 
LAR AR7 ,#02000h ;data space address for vector table 
e333 LAR AR7 ,#0800h :for TMS320C51 
MAR *, AR7 sARP <— AR7 
RPT #39 sfor I=0,1<=39,I++ 
BLPD #V_TBL, *+ sLoad vector table at 2000h 
SPLK #201Eh,PMST sNow configure S/A RAM in program space 
sand initialize vector table pointer 
SPLK #01FFh,IMR ;Clear interrupt mask register 
CLRC OVM :Disable overflow saturation mode 
LAR AR7 ,#60h sInitialize B2 block 
RPTZ #31 sfor I=0,I<=31,I++ 
SACL *+ *B2[I] = 0 
LAR AR7 ,#100h sInitialize BO and Bl blocks 
RPTZ #1023 efor I=0,I<=1023,I++ 
SACL  *+ :BO/B1[I] = 0 
CLRC INTM :Globally enable interrupts 
B MAIN PRG sReturn to foreground program 
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interrupts 


The ’C5x devices have four external maskable user interrupts (INTT—INT4) 
and one nonmaskable interrupt (NMI) available for external devices. Internal 
interrupts are generated by the serial ports, the timer, and by the software in- 
terrupt instructions (INTR, TRAP, and NMI). The interrupt structure is de- 
scribed in subsection 5.1.2, Interrupts. 


The ’C5x devices are capable of generating software interrupts using INTR in- 
struction. This allows any of the 32 interrupt service routines to be executed 
from your software. The first 20 ISRs are reserved for external interrupts, pe- 
ripheral interrupts, and future implementations. The other 12 locations in the 
interrupt vector table are user-definable. The INTR instruction can invoke any 
of the 32 interrupts available on the ’'C5x devices. 


The context saving and restoring function is done in hardware when an inter- 
rupt trap is executed. An 8-deep hardware stack is available for saving return 
addresses of the subroutines and the interrupt service routines. Also, there is 
a one-deep stack (or shadow registers) for each of the following registers: 


ACC accumulator 

ACCB accumulator buffer 

PREG product register 

STO status register 0 (INTM not restored) 
ST1 status register 1 (XF not restored) 
PMST processor mode status register 
TREGO temporary register for multiplier 
TREG1 temporary register for shift count 
TREG2 temporary register for bit test 
INDX indirect address index register 
ARCR auxiliary register compare register 


When the interrupt trap is taken, all these registers are pushed onto the one- 
deep stack. These shadow registers are popped when the return-from-inter- 
rupt (RET! or RETE) is executed. Detailed discussion of interrupts are given 
in Section 3.8, Interrupts. 


Example 7-2 illustrates the use of INTR instruction. The foreground program 
sets up auxiliary registers and invokes user-defined interrupt number 20. 
Since the context is saved automatically, the interrupt service routine is free 
to use any of the saved registers without destroying the calling program's vari- 
ables. The routine shown here uses the CRGT instruction to find the maximum 
value of 16 executions of the equation Y=aX2+bX+c. The X values are pointed 
at by AR1. AR2 and AR3 point to the coefficients and Y results, respectively. 
To return the result to the calling routine, all the registers are restored by ex- 
ecuting an RETI instruction. The computed value is placed in the accumulator, 
and a standard return is executed because the stack is already popped. 
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Example 7-2. Use of INTR Instruction 


* Foreground Program 


-mmregs 
TEMP -set 63h ;Temporary storage. 

LAR AR1 ,#X ;AR1 points to X values 

LAR AR2,#COEFF ;AR2 points to coefficients b,a,c in that order 

LAR AR3 , #Y ;AR3 points to Y results 

INTR 20 sInvoke software interrupt #20 


* 

* This routine uses the block repeat feature of the ‘C50 to find the maximum 

* value of 16 executions of the equation Y=aX*2+bX+c. The X values are pointed 
* at by AR1. The Y results are pointed at by AR3. The coefficients are pointed 
* at by AR2. At the completion of the routine, ACC contains the maximum value. 
* AR1, AR2, and AR3 are modified. All other registers are unaffected. Note that 
* this routine should not be called from within a repeat block. 

* 
I 


SR20 LDP #0 ;Use page 0 of data memory. 
LACC #08000h 
SACB sInitialize AccB with min. possible value 
MAR *, AR1 sARP <— AR1 
* 
* Load Block repeat count register with 15. 
* SPLK #0Fh,BRCR 
* 
* Repeat Block. 
* 
RPTB END LOOP-—1 :For i=0; i<=15; i++. 
ZAP sACC = PREG = X“*2 
SQRA *+,AR2 sTREGO = X PREG = X“*2 
SPL TEMP ;Save X*2. 
MPY *+ ;PREG = b*X 
LTA TEMP sTREG = X“*2 ACC = b*X 
MPY *+ sPREG = a*X%*2 
APAC sACC = a*X*2 + b*X 
ADD *,0,AR3 sACC = A*X*2 + b*¥X +c 
SACL *+,0,AR1 ;Save Y. 
CRGT sSave maximum Y. 
END_LOOP 
SACL TEMP :Save the result temporarily 
LACC #RE ENTER ; 
PUSH ;Push re-entry address onto stack 
RETI :Pop all registers 
RE_ENTER LAMM TEMP ;Load ACC with the max. value 
RET ;Return to interrupted code 
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7.3 Software Stack 


The ’C5x has an internal 8-deep hardware stack that is used to save and re- 
store return addresses for subroutines and interrupts. See subsection 3.6.1 for 
further details. Provisions have been made on the 'C5x to extend the hardware 
stack into the data memory. 


The PUSH and POP instructions can access the hardware stack via the accu- 
mulator. Two additional instructions, PSHD and POPD, are included in the in- 
struction set so that the stack may be directly stored to and recovered from the 
data memory. 


A software stack can be implemented by using POPD instruction at the begin- 
ning of each subroutine to save the PC in data memory. Then, before returning, 
a PSHD is used to put the proper value back onto the top of the stack. 


When the stack has seven values stored on it, and two or more values are to 
be put on the stack before any other values are popped off, a subroutine that 
expands the stack is needed, such as the one shown in Example 7-3. In this 
example, the main program stores the stack, starting location in memory in 
AR2 and indicates to the subroutine whether to push the data from memory 
onto the stack or pop data from the stack to memory. If a zero is loaded into 
the accumulator before calling the subroutine, the subroutine pushes data 
from memory to the stack. If the accumulator contains a nonzero value, the 
subroutine pops data from the stack to memory. 


Because the CALL instruction uses the stack to save the program counter, the 
subroutine pops this value into the accumulator and utilizes the BACC instruc- 
tion to return to the main program. This prevents the program counter from be- 
ing stored into a memory location. The subroutine in Example 7—3 uses the 
BCNDD (delayed conditional branch) instruction to determine whether a save 
or restore operation is to be performed. 


Example 7—3.Software Stack Operation 
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* 

* This routine expands the stack while letting the 
* main program determine where to store the stack 
* contents, or from where to restore them. 

* Entry Conditions: 

* ACC = 0 (restore stack); 1 (save stack) 

* AR2 —> Top of software stack in data memory 

* 
S 


TACK: BCNDD POP,NEQ ;Delayed branch if POPD required 
MAR *,AR2 ;Use AR2 as stack pointer 
POP Get return address 
RPT #6 srepeat 7 times 
PSHD *+ *Put memory in stack 
BACC ;Return to main program 
POP: MAR *— sAlign AR2 
RPT #6 ;Repeat 7 times 
POPD *— ;Put stack in memory 
MAR w+ ;Realign stack pointer 
BACC sReturn to main program 
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7.4 Logical and Arithmetic Operations 


7.4.1 Parallel Logic Unit (PLU) 


The PLU provides direct logical path to data memory values without affecting 
the contents of the accumulator or product register. It allows direct manipula- 
tion of bits in any location in data memory space. Source operand can be either 
a long immediate value or the dynamic bit manipulation register (DBMR). The 
use of a long immediate value is particularly effective in initializing data 
memory locations, including the memory-mapped registers. The use of DBMR 
as source operand allows run-time computation of operands. It also reduces 
instruction execution time to one cycle, which may be important for time-critical 
routines. 


Example 7—4 and Example 7-5 illustrate the use of PLU for initialization and 
logical operation. The UNPACK subroutine extracts individual bits from a 
single word and stores them separately in an array. The PACK subroutine does 
the opposite of UNPACK by getting each bit from a different location and pack- 
ing them in a single word. In Example 7—5, notice that a NOP instruction is in- 
serted in the repeat-block loop to make it three words long. A repeat-block 
must be at least three words long on ’C5x devices. 


Example 7—4.Using PLU to Do Unpacking 


-title ‘Routine to extract bits from a single word’ 
* PCKD 
* 
* [Bn BO | 
* 
* 
* UNPCKD 
* 
* 10 — 0 {Bn 
* 
* |0 — O/]Bn-1| 
* 
* 
* 
* |0 —  0{Bo| 
* 
mmregs 
NO_BITS .set 16 ;number of packed bits in the word 
PCKD -set 60h ;Input word 
UNPCKD .set 61h ;Output buffer. Each word will have 
sone bit in LSB location. 
etext 
UNPACK LDP #0 ;DP=0 
MAR * , ARO 
LAR ARO, #UNPCKD+NO_BITS—1 ;End of table address 
SPLK #NO_BITS—1,BRCR sInitialize the count register 
SPLK #1,DBMR ;Load mask in DBMR register 
LACC PCKD >Packed bits —> Acc 
RPTB LOOP—1 ;Begin looping 
SACL * ;Save remaining packed bits 
APL + ;Keep the LSB only 
SFR ;Shift right to eliminate unpacked bit 
LOOP RET :Return back 
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Example 7-5.Using PLU to Do Packing 


1.4.2 


-title ‘Routine to pack input bits in a single word’ 
* 
*  PCKD 
* 
*  |Bn BO | 
* 
* 
* UNPCKD 
* 
* |0 — O |[Bn| 
¥e 
* |0 — 0O|Bn-1| 
* 
* 
* 
* |0 — 0/BO| 
* 
* 
-data 
NO_BITS .set 16 sNumber of bits to be packed 
PCKD eset 60h »Packed word 
UNPCKD .set 61h ;Array of unpacked bits 
text 
PACK LAR ARO , #UNPCKD ;ARO points to start of UNPACKED array 
MAR * , ARO sARP <— ARO 
LDP #0 ;DP=0 
SPLK #NO_BITS—2,BRCR ;Loop NO_BITS—1 times 
LACC w+ sGet the MSB 
RPTB LOOP—1 ;Begin looping 
SFL ;Make space for next bit 
ADD w+ sPut next bit 
NOP 
LOOP 
SACL PCKD ;Store the result 
RET sReturn back 


Multiconditional Branch Instruction 


The ’C5x allows multiple conditions to be tested before passing control to 
another section of program. Any of the following 13 conditions can be tested 
individually or in combination with others by CC, RETC, XC, and BCND in- 
structions: 


ACC=0 EQ 
ACC#0 NEQ 
ACC<0 LT 
ACCs0 LEQ 
ACC>0 GT 
ACC20 GEQ 
C=0 NC 
C=1 C 
OV=0 NOV 
OV=1 OV 
BIO low BIO 
TC=0 NTC 
TC=1 TC 
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Testing the status of TC flag is mutually exclusive to testing the BIO pin. The 
code in Example 7-6 tests the carry flag and the sign bit of the accumulator 
simultaneously to locate a zero bit (beginning from MSB) in a 64-bit word con- 
sisting of ACC and ACCB with ACC having the higher part. This 64-bit word 
could be the serial port output where the first zero indicates the start bit. 


Example 7-6.Using Multiple Conditions With BOND 


LDP #0 
SPLK #63, BRCR sno. of iterations — l 
;code to get 64-bit input word and load 
sit in ACC and ACCB 
LAR ARO , #0 sinitialize the bit counter 
RPTB ENDLOOP-—1 sfor I=0,1<=63,I++ 
SFLB sshift left ACC+ACCB, MSB is shifted 
* ;out in Carry flag 
MAR *+ sincrement bit counter 
BCND ENDLOOP,NC,LT ;exit if carry=0 and current MSB=1 
ENDLOOP: ;ACC+ACCB contains aligned data now 
APL #0f£ffeh,PMST ;clear BRAF flag 


7.4.3 Search Algorithm Using CRGT 


The following example shows how the CRGT and RPTB instructions find the 
maximum value and its location by searching through a block of data. Loop 
overhead is minimized by using the block-repeat function. The accumulator is 
initialized with the minimum possible value (O8000h) before the main search 
loop is entered. 


To find the minimum value, CRGT instruction may be replaced by CRLT, and 
the accumulator is loaded with the maximum possible value (O7FFFh) instead 
of the smallest. The rest of the code remains the same. 


Example 7—7.Using CRGT and CRLT 


* 
* This routine searches through a block of data in the data memory 
* to store the maximum value and the address of that value in memory 
* locations MAXVAL and MAXADR, respectively. The data block could be 
* of any size defined by the Block Repeat Counter Register (BRCR). 
* 
* KEY C5X instructions: 
* 
* RPTB Repeat a block of code as defined by repeat counter BRCR 
* CRGT Compare ACC to ACCB. Store larger value in both ACC, ACCB. 
* Set CARRY bit if value larger than previously larger is found 
* XC Execute conditionally (1 or 2 words) if flag (Carry) is set. 
* 
* 
MAXADR .set 60h 
MAXVAL .set 61h 
-mmregs 
- text 
LDP #0 :point to data page 0 
LAR ARO, #0300h ;AR= data memory addr 
SETC SXM ;set sign extension mode 
LACC #08000h s:load minimum value 
* Use #07FFFh (largest possible) to check for minimum value 
SACB sinto ACCB 
SPLK #9,BRCR srpt cont = 9 for 10 data values 
RPTB endb —1 ;repeat block from here to endb—-1l 
startb: 
LACC * s;load data from <(ARO)> into ACC 
CRGT ;set carry if ACC > previous largest 
* Use CRLT to find minimum value 
SACL MAXVAL ;save new largest which is in ACC & ACCB 
XC #1,C ;save addr if current value > previous largest 
SAR ARO, MAXADR 
MAR e+ 
endb: RET 


* At the end of routine, following 
* registers contain: 


* ACC 

ACCB 
(MAXVAL ) 
(MAXADR) 


-data 
-word 
eword 
eword 
eword 
eword 
-word 
-word 
-word 
eword 
-word 
-end 


+ + + 


= 32050 
32050 
32050 
0307h 


;data is expected to be in data RAM 

5000 sstart address = 0300h 
10000 

320 

3200 

—5600 

—2105 

2100 

32050 

1000 

—1 


7.4.4 Matrix Multiplication Using Nested Loops 
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The ’C5x provides three different types of instructions to implement code 
loops. The RPT (single-instruction repeat) instruction allows the following in- 
struction to be executed N times. The RPTB (repeat block) instruction repeat- 
edly executes a block of instructions with the loop count determined by the 
BRCR count register. The BANZ (branch if AR not zero) instruction is another 
way of implementing for-next loops with the count specified by an auxiliary reg- 
ister. 


Software Applications 


Logical and Arithmetic Operations 


Three-level-deep nested loops can be efficiently implemented by these three 
instructions with each instruction controlling one loop. The following example 
implements this nested code structure to do N-by-N matrix multiplication. Note 
the use of BANZD (delayed BANZ) instruction to avoid flushing the instruction 
pipeline. Also, note the use of MADS (multiply-accumulate using BMAR) in- 
struction to dynamically switch between the rows of matrix A to compute the 
elements of the product matrix C. 


Example 7—8.Using Nested Loops 


etitle "NxN Matrix Multiply Routine” 


-mmregs 


* 


* This routine performs multiplication of two NxN matrices. 


* A x B = C where A,B, and C are NxN in size. 
* Entry Conditions: 
* AR1 —> element (0,0) of A (in program space) 
* AR2 —> element (0,0) of B (in data space) 
* AR3 —> element (0,0) of C (in data space) 
* DP = QO, NDX = 1 
* ARP = 2 
* Storage of matrix elements in memory (beginning from low 
* memory) : 
: M(0,0),-.--,M(0,N—1),M(1,0),.-..,M(N—1,N—1) 
MTRX_MPY: 
LAR ARO ,#(N-1) ;set up loop count 
SPLK #N, INDX ;row size 
SAR AR2,AR4 *Save addr of B 
* sfor i=0,i<N,++i 
LOOP1: SMMR AR1,BMAR ;BMAR —> A(i,0) 
SPLK #(N-—1),BRCR ;setup loop2 count 
SAR AR4,AR5 sAR5 —> B(0,0) 
LOOP2: RPTB  ELOOP2 sfor j=0,j<N,++3 
SAR AR5 , AR2 sAR2 —> B(0,3) 
LOOP3: RPTZ #(N-—1) sfor k=0,k<N,++k 
ELOOP3 :MADS *0+ ;Acc=A(i,k)xB(k,3j) 
APAC *Final accumulation 
MAR * ,AR5 ;ARp = ARS 
MAR *+,AR3 sAR5 —> B(0,3+1) 
ELOOP2 : SACL *+,0,AR2 ;Save C(i,j) 
MAR * ,ARO sloop back if 
BANZD LOOP1,*—,ARI1 scount != N 
ADRK N sAR1 —> A(it+1,0) 
ELOOP1: MAR * ,AR2 ;ARp = AR2 
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7.5 Circular Buffers 
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Circular addressing is an important feature of the ’'C5x instruction set. Algo- 
rithms like convolution, correlation, and FIR filters can make use of circular 
buffers in memory. The ’C5x supports two concurrent buffers operating via the 
auxiliary registers. These five memory-mapped registers control the circular 
buffer operation: CBSR1, CBSR2, CBER1, CBER2, CBCR. See subsection 
4.1.6 for details. 


The start and end addresses must be loaded in the corresponding buffer regis- 
ters before the circular buffer is enabled. Also, the auxiliary register that acts 
as a pointer to the buffer must be initialized with the proper value. 


Example 7-9 illustrates the use of a circular buffer to generate a digital sine 
wave. A 256-word sine-wave table is loaded in the B1 block of dual-access in- 
ternal data memory from external program memory. Accessing the internal 
dual-access memory requires only one machine cycle. The block move ad- 
dress register (BMAR) is loaded with the ROM address of the table. The 
block-move instruction moves 256 samples of sine wave to internal data 
memory, which is then set up as a circular buffer. 


The start and end addresses of this circular buffer are loaded into the corre- 
sponding registers. The auxiliary register AR7 is also initialized to the begin- 
ning of the sine-wave table. Note the use of SAMM instruction to update AR7. 
This is possible because all auxiliary registers are memory-mapped at page 
0. Finally, the circular buffer #1 is enabled, and AR7 is mapped to that buffer. 
The other circular buffer is disabled. 


Whenever the next sample is to be pulled off from the table, postincrement in- 
direct addressing may be used with AR7 as the pointer. This ensures that the 
pointer will wrap around to the beginning of the table if the previous sample 
was the last one on the table. 
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Example 7-9. Use of Circular Addressing 


etitle ‘Digital Sine—Wave Generator’ 
smmregs 


RAKKKKKKKKKKKKKEKKKKKKKKK KKK KKK KKK KKKRKR KKK KKKKKKRKKRKR KK KKK KKK KKK 


* This routine illustrates the circular addressing capability of 

* TMS320C5x devices. A digital sine wave generator is implemented 
* as a circular buffer #1 with AR7 as its pointer. XSINTBL is the 
* location in external program memory where this table is stored. 
* It is moved to internal data memory block Bl where it is setup 

* 
* 


as a circular buffer. 
KKKKKKKKKKHAKEKEEKKKEKEKEEKKEKREREKKKEKRMEKKRKKRKEKKRRKRKRKRKRKKRKRKKKKKKKKKK KKK 


XSINTBL .set 03000h sprogram space address of sine table 
-text 


SINTBL LDP #0 
LAR ARO,#0300h ;address of Bl block 


MAR *,ARO 
LACC #XSINTBL ;get sine table address in 
* ;external program memory 
SAMM BMAR ;load source register 
* 
RPT #255 smove 256—word 
BLPD BMAR, *+ s;load table from external program 
* smemory to internal data memory 
SAMM CBSR1 sstart address of buffer=300h 
SAMM AR7 ;AR7 points to start of buffer 
ADD #255 
SAMM CBER1 send address of buffer=3ffh 


SPLK #0Fh,CBCR senable CB¥1, disable CB#2 
; ;pointer for CB#1 is AR7 


NXTSMP MAR *,AR7 
LACC *+ ;get next sample from table 
;AR7 is updated to next valid sample 


DISBLE APL #0FFF7h,CBCR ;Disable CB#¥1 


° 


RET 


If the step size must be greater than one, check to see if an update to the auxil- 
iary register generates an address outside the range of the circular buffer. This 
may happen if the same sine table is used to generate sine waves of different 
frequencies by changing the step size. Modulo addressing can avoid such 
problems. A simple way to perform modulo addressing on ’C5x devices is to 
use the APL and OPL instructions. For example, to implement the modulo-256 
counter, first load the DBMR (dynamic bit manipulation register) with 255 (the 
maximum value allowed); when the auxiliary register is updated (by any 
amount), it is ANDed with the DBMR register and ORed with the start address 
of the buffer. The start address of the modulo-2* buffer must have zeros in the 
kLSBs. Hence, for modulo-256 addressing, the first 8 LSBs of the start register 
must be zero. 
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The following code does modulo-256 addressing: 


START .set 04000h ;start address of the buffer 
LDP #0 
LACL #0FFh 
SAMM DBMR smax value = 255 
MAR *0+ sincrement AR7 by some amount 
APL AR7 sextract lower 8 bits 
OPL #START,AR7 ;add the start address 
Software Applications 
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7.6 Single-Instruction Repeat (RPT) Loops 


The 'C5x provides two different types of repeat instructions. The repeat block 
RPTB instruction implements code loops that can be 3 to 65536 words in size. 
These loops do not require any additional cycles to jump from the end-of-block 
to the start-of-block address at the end of each iteration. In addition, these 
zero-overhead loops are interruptible so that they can be used in background 
processing without affecting the latency of time-critical tasks. 


On the other hand, the single-instruction repeat RPT pipelines the execution 
of the next instruction to provide a high-speed repeat mode. A 16-bit repeat 
counter RPTC allows execution of a single instruction 65536 times. When this 
repeat feature is used, the instruction being repeated is fetched only once. As 
a result, many multicycle instructions, such as MAC/MACD, BLDD/BLDP, or 
TBLR/TBLW, become single-cycle when repeated. 


Some of ’C5x instructions behave differently in the single-instruction repeat 
mode to efficiently utilize the 'C5x multiple-bus architecture. The following in- 
structions fall in this category: 


BLDD, BLDP, BLPD, IN, OUT, MAC, MACD, MADS, MADD, TBLR, TBLW, 
LMMR, SMMR 


Because the instruction is fetched and internally latched when in single-in- 
struction repeat mode, the program bus is used by these instructions to read 
or write a second operand in parallel to the operations being done using the 
data bus. With the instruction latched for repeated execution, the program 
counter is loaded with the second operand address (which may be in data, pro- 
gram, or |/O space) and incremented on succeeding executions to read/write 
in successive memory locations. As an example, the MAC instruction fetches 
the multiplicand from the program memory via the program bus. Simulta- 
neously with the program bus fetch, the second multiplicand is fetched from 
data memory via the data bus. In addition to these data fetches, preparation 
is made for accesses in the following cycle by incrementing the program count- 
er and by indexing the auxiliary register. IN instruction is another example of 
an instruction that benefits from simultaneous transfers of data on both the 
program and data buses. In this case, data values from successive locations 
in 1/O space may be read and transferred to data memory. For complete details 
of how the above-listed instructions behave in repeat mode, see the individual 
description of each instruction in Chapter 4. 


The following example demonstrates the implementation — of 
memory-to-memory block moves on the ’C5x using single-instruction repeat 
(RPT) loops. 
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uction Repeat Loops 


Example 7-10. Memory-to-Memory Block Moves Using RPT 
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-mmregs 
text 


* 
* This routine uses the BLDD instruction to move external 


‘* data memory to internal data memory. 


* 


MOVEDD: 
LACC #4000h 
SAMM BMAR ;BMAR —> source in data memory. 
LAR AR7,#100h ;AR7 —> destination in data memory 
MAR *,AR7 ;LARP = AR7. 
RPT #1023 sMove 1024 value to blocks BO and Bl 
BLDD BMAR, *+ 
RET 
* 
* This routine uses the BLDP instruction to move external 
* data memory to internal program memory. This instruction could be 
* used to boot load a program to the 8K on chip program memory from 
* external data memory. 
* 
MOVEDP: 
LACC #800h 
SAMM , BMAR ;BMAR —> destination in program memory 
LAR AR7,#0E000h ;AR7 —> source in data memory. 
RPT #8191 ;Move 8K to program memory space. 
BLDP *+ 
RET 
* 
* This routine uses the BLPD instruction to move external 
* program memory to internal data memory. This routine 
* is useful for loading a coefficient table stored in 
* external program memory to data memory when no external 
* data memory is available. 
* 
MOVEPD: 
LAR AR7 ,#100h ;AR7 —> destination in data memory. 
RPT #127 ;Move 128 values from external program 
BLPD #3800h, *+ ;to internal data memory BO. 
RET 
* 
* This routine uses the TBLR instruction to move program 
* memory to data memory space. This differs from the BLPD 
* instruction in that the accumulator contains the address 
* in program memory from which to transfer. This allows 
* for a calculated, rather than pre-determined, location in 
* program memory to be specified. 
* 
TABLER: 
MAR *,AR3 ;AR3 —> destination in data memory. 
LAR AR3 ,#300h 
RPT #127 ;Move 128 items to data memory block Bl 
TBLR *+ 
RET 
* 
* This routine uses the TBLW instruction to move data memory 
* to program memory. The calling routine must contain the destination 
* program memory address in the accumulator. 
* 
TABLEW: 
MAR *,AR4 ;ARP = AR4. 
LAR AR4,#380h ;AR4 —> source address in data memory. 
RPT #127 sMove 128 items from data memory to 
TBLW *+ ;program memory. 
RET 
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This routine uses the SMMR instruction to move data 
from a memory—mapped I/O port to local data memory. 
Note that 16 I/O ports are mapped in data page 0 of 
the ‘C5x memory map. 


* 
* 
* 
* 
* 
* 


INPUT: 
LDP #0 
RPT #511 ;Input 512 values from port 51h to 
SMMR 51h,800h stable at 800h in data memory. 
RET 


* 

* This routine uses the LMMR instruction to move data from 
* local data space to a memory-mapped I/O port. Note that 
* 16 I/O ports are mapped in data page 0 of TMS320C5x 


* memory map. 
* 
OUTPUT: 
LDP #0 ;data page 0 
RPT #63 ;Output 64 values from a table at 800h 
LMMR 50h,800h sin data memory to port 50h. 
RET 
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7.7 Subroutines 


Example 7-11 illustrates the use of a subroutine to determine the square root 
of a 16-bit number. The main routine executes to the point where the square 
root of anumber should be taken. At this point, a delayed call (CALLD) is made 
to the subroutine, transferring control to that section of the program memory 
for execution and then returning to the calling routine via the delayed return 
(RETD) instruction when execution has completed. 


This example shows several features of ’C5x instruction set. In particular, note 
the use of delayed-call (CALLD), delayed-return (RETD), and conditional-exe- 
cute (XC) instructions. Due to the four-level-deep pipeline on 'C5x devices, 
normal branch instructions require 4 cycles to execute. Using delayed 
branches, only two cycles are required for execution. The XC instruction is 
useful where only one or two instructions are to be executed conditionally. In 
this example, notice how XC is used to avoid extra cycles due to branch in- 
struction. Use of the XC instruction also helps in keeping the execution time 
of a routine constant, regardless of input conditions. This is because XC ex- 
ecutes NOPs in place of instructions if conditions are not met. 


Example 7-11. Square Root Computation Using XC 
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Autocorrelation 

This routine performs a correlation of two vectors and then 
calls a Square Root subroutine that will determine the RMS 
amplitude of the wave form. 


+ + + + 


AUTOC 


CALLD SQRT ;Call square root subroutine after 
MAR * , ARO sexecuting next two instructions 
LACC * ;Get the value to be passed to SQRT 

‘ ssubroutine 


Square Root Computation 


+ +£ + & € 


This routine computes the square root of a number that is located 
in the lower half of accumulator. The number is in Q15 format. 

* 

BRCR -set 09h >DP=0 

STO -set 60h :Internal RAM block B2 

STl -set 61h 

NUMBER .set 62h 

TEMPR .set 63h 

GUESS .set 64h 


text 
SQRT SST #0,STO 
SST #1,ST1 *Save context 
LDP #0 
SETC SXM sSet SXM=1 
SPM 1 *Set PM mode for fractional arithmetic 
SACL NUMBER :Save the number 
LACL #0 
SACB sClear accumulator buffer 
SPLK #11,BRCR sinitialize for 12 iterations 


SPLK #800h,GUESS ;Set initial guess 


Software Applications 


LACC 
SUB 
BCNDD 
SPLK 
LACC 
SACL 
SACL 
SPLK 
LOOP RPTB 
SQRA 
LACC 
SPAC 
NOP 
XC 
LACC 
SACB 
LACC 
SACH 
ADDB 
SACH 
ENDLP LACB 
RETD 
LST 
LST 


Subroutines 


NUMBER 
#200h 
LOOP , LT sI£f NUMBER<200h then begin looping 


#800h, TEMPR 

#4000h ;Otherwise set initial guess 
GUESS sand temporary root to 4000h 
TEMPR 


#14,BRCR ;and increase iterations to 15 
ENDLP—1 ;Repeat block 

TEMPR ;Square temporary root 
NUMBER, 16 


s ACC=NUMBER~—TEMPR** 2 
:Dead cycle for XC 
2,GT ;If NUMBER>TEMPR**2 skip next 2 instr. 
TEMPR, 16 
;Otherwise ROOT <— TEMPR 
GUESS,15 
GUESS s;GUESS <— GUESS/2 
TEMPR sTEMPR <— GUESS+ROOT 
;High Acc contains square root of NUMBER 
#1,ST1 
#0,STO ;Restore context 


Note that the restore is done with the LST instruction to prevent ARP from be- 
ing overwritten. If indirect addressing is used, the order is reversed. 


7-19 


Extended-Precision Arithmetic 


7.8 Extended-Precision Arithmetic 


Numerical analysis, floating-point computations, or other operations may re- 
quire arithmetic to be executed with more than 32 bits of precision. Since the 
‘C5x devices are 16/32-bit fixed-point processors, software is required for the 
extended precision of arithmetic operations. Subroutines that perform the ex- 
tended-arithmetic functions for 'C5x are provided in the examples of this sec- 
tion. The technique consists of performing the arithmetic by parts, similar to 
the way in which longhand arithmetic is done. 


The ’C5x has several features that help make extended-precision calculations 
more efficient. One of the features is the carry bit. This bit is affected by all arith- 
metic operations of the accumulator, including addition and subtraction with 
the accumulator buffer. This allows 32-bit-long arithmetic operations using the 
accumulator buffer as the second operand. 


The carry bit is also affected by the rotate and shift accumulator instructions. 
It may also be explicitly modified by the load status register ST1 and the set/re- 
set control bit instructions. For proper operation, the overflow mode bit should 
be reset (OVM = 0) so that the accumulator results is not loaded with the satu- 
ration value. 


7.8.1 Addition and Subtraction 
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The carry bit is set whenever the addition of a value from the input scaling shift- 
er, the P register, or the accumulator buffer to the accumulator contents gener- 
ates a carry out of bit 31. Otherwise, the carry bit is reset because the carry 
out of bit 31 is a zero. One exception to this case is the addition to the accumu- 
lator with a shift of 16 instruction (ADD mem,16), which can only set the carry 
bit. This allows the ALU to generate a proper single carry when the addition 
either to the lower or the upper half of the accumulator actually causes the 
carry. The following examples help to demonstrate the significance of the carry 
bit of the ’C5x for additions: 
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Figure 7—1. 32-Bit Addition 


C MSB LSB C MSB LSB 

X FFFFFF F F ACC X FFFFFFFF ACC 
+ +F FFFFFFEFE 

1 00000000 1 FFFFFFFE 

Cc MSB LSB C MSB LSB 

X 7FFFFEFFF F ACC X 7FFFFFFF ACC 
+ + 

0 80000000 1 7FFFFFFE 

C MSB LSB C MSB LSB 

xX 8000000 0 ACC 1 80000000 ACC 

ES, a eS! mama SH? Se <A! eS 

0 80000001 1 7FFFFFFFF 

C MSB LSB C MSB LSB 

1 00000000 acc 1 FFFFFFF F ACC 

a I cL ADDE) Ss ae ete 

{ADDC) 

0 00000001 1 0000000 0 

C MSB LSB Cc MSB LSB 

1 8000F FFF ACC 1 8O0O00F FFF ACC 
+ 1 1 —t/FEFFOOO O 

1 8000FFFF 1 FFFFFFFF 


Example 7—12 shows an implementation of two 64-bit numbers added to each 


other to obtain a 64-bit result. 


Example 7-12. 64-Bit Addition 


* 

* 

* 

* 

* 

* 

* below. 

* 

be X3 X2 X1 X0 

* + Y3 Y2 Y1 YO 

* ee 

* W3 W2 Wl WO —OR— ACC ACCB 

* 

ADD64 #LACC X1,16 ;ACC = 
ADDS X0 ;ACC = 
ADDS Yo ;ACC = 
ADD Y¥1,16 ;ACC = 
SACL wo ; THESE 
SACH Wil 
LACC X3,16 ;ACC = 
ADDC X2 ;ACC = 


X1 00 
X1 X0 
X1 x0 + 00 YO 
X1 x0 + Y1 YO 


Two 64-bit numbers are added to each other producing a 
64—bit result. The number X (X3,X2,X1,X0) and Y 
(Y3,Y¥2,Y¥1,Y0) are added resulting in W (W3,W2,W1,W0). 
If the result is required in 64—bit ACC/ACCB pair, 
replace the instructions as indicated in the comments 


2 INSTR ARE REPLACED BY 


X3 00 
X3 X2 + C 


;”SACB” IF RESULT IS DESIRED IN (ACC ACCB) 
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ADDS Y2 ;ACC = X3 X2 + 00 Y2+C 

ADD Y¥3,16 ;ACC = X3 X2 + Y3 Y2+C 

SACL W2 ;THESE 2 INSTR ARE NOT REQUIRED IF 
SACH W3 ;THE RESULT IS DESIRED IN (ACC ACCB) 
RET 


As in addition, the carry bit on the ’'C5x is reset whenever the input scaling shift- 
er, the P register, or the accumulator buffer value subtracted from the accumu- 
lator contents generates a borrow into bit 31. Otherwise, the carry bit is set be- 
cause no borrow into bit 31 is required. One exception to this case is the SUB 
mem, 16 instruction, which can only reset the carry bit. This allows the genera- 
tion of the proper single carry when the subtraction from either the lower or the 
upper half of the accumulator actually causes the borrow. The examples in 
Figure 7-2 demonstrate the significance of the carry bit for subtraction. 


Figure 7—2. 32-Bit Subtraction 


C MSB LSB C MSB LSB 
xX 00000000 ACC xX 00000000 
ACC 
—_ 1 —F FFFFFFEFE 
0 FFFFFFFF 0 00000001 
C MSB LSB C MSB LSB 
X 7 F FF F F F F ACC X 7 FFFFFFF 
ACC 
ee ee eee ___F_ F F FF FFF 
1 7 FFFFFFE Cc 8000000 0 
C MSB LSB C MSB LSB 
X 8 0000000 ACC X 80000000 
ACC 
ae __ oF FFFFFFE 
1 7F FFFFFF 0 80000001 
C MSB LSB C MSB LSB 
0 00000000 ACC 0 FFFFFFFF 
ACC 
_ - (iti SU BB) CD 
(SUBB ) 
0 FFFFFFFF 1 FF FFFFFE 
C MSB LSB C MSB LSB 
0 8 000FF F F ACC 0 8 OO OFFFF 
ACC 
__-0 001000 0 (SUB mem,16)  _ #--F FFFOO0 90 
B mem 


0 7 FFFFFFF 0 80 01FFFF 


Example 7—13 implements the subtraction of two 64-bit numbers on the ’C5x. 
A borrow is generated within the accumulator for each of the 16-bit parts of the 
subtraction operation. 
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Example 7-13. 64-Bit Subtraction 


7.8.2 Multiplication 


* 
Two 64-bit numbers are subtracted, producing a 64—bit 


* result. The number Y (Y3,Y2,Y1,Y0) is subtracted from 

* X (X3,X2,X1,X0) resulting in W (W3,W2,W1,W0). 

* If the result is required in 64—bit ACC/ACCB pair, 

* replace the instructions as indicated in the comments 

* below. 

* 

*  X3 X2 X1 x0 

* — Y3 Y2 Y1 YO 

* LS 

* W3 W2 Wl WO —OR— ACC ACCB 

* 

SUB64 LACC X1,16 : ACC = X1 00 
ADDS x0 ; ACC = X1 X0 
SUBS YO s ACC = X1 XO — 00 YO 
SUB Y¥1,16 ; ACC = X1 xO — Y1 YO 
SACL wo ; THESE 2 INSTR ARE REPLACED BY 
SACH W1 ; “SACB” IF RESULT IS DESIRED IN (ACC ACCB) 
LACL X2 ; ACC = 00 X2 
SUBB Y2 s ACC = 00 X2 — 00 Y2 —C 
ADD X3,16 s ACC = X3 X2 — 00 Y2-—C 
SUB Y¥3,16 ; ACC = X3 X2 — Y3 Y2-—C 
SACL W2 ; THESE 2 INSTR ARE NOT REQUIRED IF 
SACH W3 ; THE RESULT IS DESIRED IN (ACC ACCB) 
RET 


Another important feature that aids in extended-precision calculations is the 
MPYU (unsigned multiply) instruction. The MPYU instruction allows two un- 
signed 16-bit numbers to be multiplied and the 32-bit result placed in the prod- 
uct register in a single cycle. Efficiency is gained by generating partial products 
from the 16-bit portions of a 32-bit or larger value instead of having to split the 
value into 15-bit or smaller parts. 


Further efficiency is gained by using the accumulator buffer to hold partial re- 
sults instead of using a temporary location in data memory. The ability of ’C5x 
devices to barrel-shift the accumulator by 1 to 16 bits in only one cycle is also 
useful for scaling and justifying operands. 


For 16-bit integer multiplication, in which one operand is a 2s-complement 
signed integer and the other one is an unsigned integer, you can use the algo- 
rithm shown in Figure 7-3. 
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Figure 7-3. 16-Bit Integer Multiplication 


x 


- 


Signed Integer 


- 


Unsigned Integer 


x< 
x 
< 


Signed Multiplication 


- 


Add X if Y 15 = 1 


x< 
x 
~< 


Final 32-Bit Result 


Steps required: 
1) Multiply two operands X and Y as if they are signed integers. 


2) If MSB of the unsigned integer Y is 1, add X to the upper half of the 32-bit 
signed product. 


The correction factor must be added to the signed multiplication result be- 
cause the bit weight of the MSB of any 16-bit unsigned integer is 2'°. 


Consider following representation of a signed integer X and an unsigned inte- 
ger Y: 


X= —215x,., + 214x544 + 213x,4, F260 2'x, + 20x, 
Y= 215y.. + 2l4y44 + 2'3y 144 +... + 2ly, + 20v9 
Multiplication of X and Y yields: 
XxY = X x (2M yy5 4+ QWMy14 + Qlyig +... + Qlyy + Wyo) 
= 2'5y. 6X + 2141 4X + 2'8y15X + oes + 2!y4X + 20y9X (1) 
However, if X and Y are considered signed integers, their multiplication yields: 
XxY =X x (-2'5y,. + 2'4y44 + 213y 49 +. 2'y, + 20y0) 


= —219y 5X + 2'4y1 4X + 21yigX +... + DlyyX + VOX (2) 
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The difference between (1) and (2) is in the first term on the right-hand side 
of the two equations. 


Hence, if we add the correction term, 2'®y,<X, to equation (2), the result would 
be identical to that of equation (1) and is the correct result. 


This method of multiplying a signed integer with an unsigned integer can be 
used to implement extended-precision multiplication on ’C5x. The following 
description of a 32-bit multiplication algorithm is based on this method: 


Figure 7-4. 32-Bit Multiplication Algorithm 


XO x YO 


Unsigned Multiplication 


X1 x YO 


Signed Multiplication 


XO x Y1 


Signed Multiplication 
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If MSB of YO = 1 


E 


If MSB of XO = 1 


X1 x Y1 


+ Signed Multiplication 


Final 64-Bit Result 
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The following example implements this algorithm. The product is a 64-bit inte- 
ger number. Note in particular, the use of BSAR and XC instructions. 


Example 7-14. 32-Bit Integer Multiplication 


-title "32—bit Optimized Integer Multiplication” 
def MPY32 


* 
* This routine multiplies two 32—bit signed integers result— 
* ing in a 64-bit product. The operands are fetched from 
* data memory and the result is written back to data memory. 
* Data Storage: 
* X1,xX0 32—bit operand 
* Y1,yY0 32-bit operand 
* W3,W2,W1,W0 64—bit product 
* Entry Conditions: 
* DP = 6, SXM = 1 
* OVM = 0 
* 
X1 -set 300h :DP=6 
x0 .set 301h ;DP=6 
Yl -set 302h >DP=6 
YO -set 303h :DP=6 
W3 -set 304h > DP=6 
W2 set 305h ;DP=6 
Wl1 -set 306h :DP=6 
wo set 307h ;DP=6 
text 
MPY32: 
BIT X0,0 ;TC = XO bit#15 
LT X0 7T = XO 
MPYU YO ;P = X0OYO 
SPL wo :Save WO 
SPH W1 ;Save partial W1 
MPY Yl 7;P = X0OY1 
LTP X1 s;Acc = X0Y1, T = Xl 
MPY Y0 ;P = X1Y0 
MPYA Yl sAcc = X0Y1+X1Y0, P=X1Y1 
ADDS W1 ;Acc = X0Y1+X1Y0+X0Y02*—16 
SACL Wl sSave final Wl 
BSAR 16 ;Shift Acc right by 16 
XC 1,TC ;If MSB of XO is 1 
ADD Yl :Add Yl 
BIT Y¥O,0O ;TC = YO bit#¥15 
APAC sACC = X1Y1 + (XOY1+X1Y0)2*-16 
XC 1,TC ;IF MSB of YO is l 
ADD X1 sAdd Xl 
SACL wW2 Save W2 
SACH W3 ;Save W3 
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The next example performs fractional multiplication. The operands are in Q31 
format, while the product is in Q30 format. 


Example 7-15. 32-Bit Fractional Multiplication 


etitle "32-bit Fractional Multiplication” 


i i i eee 
; This routine multiplies two Q31 signed integers resulting 
; in a Q30 product. The operands are fetched from data 

; memory and the result is written back to data memory. 

; Data Storage: 
; 
a 
4 
a 
a 


> If MSB of YO is 1 
> then add X1 

; Save lower product 
> Save upper product 


X1,X0 Q31 operand 
Y1,Y0 Q31 operand 
W1,Ww0 Q30 product 
> Entry Conditions: 
DP = 6, SXM = 1 
; OVM = 0 
POT T TTT TTT TTR TOT TET ETE TTT EPP P TOPO PT EET OP TPT T OPPOSE PEEOPEOS 
X1 -set 300h ;DP=6 
X0 -set 301h ;DP=6 
Y1 -set 302h ;DP=6 
Yo set 303h ;DP=6 
W1 -set 304h ;DP=6 
wo -set 305h :DP=6 
text 
BIT X0,0 ; TC = XO bit#15 
LT X0 ; TREGO = X0 
MPY Y1 ; P = X0*Y0 
LTP X1 ; Acc = X0*Y0 
MPY x0 ; P = X1*Y0 
MPYA Yl * Acc = X0*YO + X1*YO 
BSAR 16 ; Throw away low 16 bits 
XC 1,TC ; If MSB of XO is 1 
ADD Yl ; then add Yl 
BIT YO,O ; TC = YO bit#15 
APAC ; Acc = Acc + X1*Y1 
e 
a 
c 


7.8.3 Division 


Integer and fractional division is implemented on the 'C5x by repeated subtrac- 
tions executed with SUBC, a special conditional subtract instruction. Given a 
16-bit positive dividend and divisor, the repetition of the SUBC command 16 
times produces a 16-bit quotient in the low accumulator and a 16-bit remainder 
in the high accumulator. 


SUBC implements binary division in the same manner as long division is done. 
The dividend is shifted until subtracting the divisor no longer produces a nega- 
tive result. For each subtract that does not produce a negative answer, a one 
is put in the LSB of the quotient and then shifted. The shifting of the remainder 
and quotient after each subtract produces the separation of the quotient and 
remainder in the low and high halves of the accumulator. 


Both the dividend and the divisor must be positive when using the SUBC com- 
mand. Thus, the sign of the quotient must be determined and the quotient com- 
puted by using the absolute value of the dividend and divisor. 
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Integer and fractional division can be implemented with the SUBC instruction 
as shown in Example 7-16 and Example 7-17, respectively. When imple- 
menting a divide algorithm, it is important to know if the quotient can be repre- 
sented as a fraction and the degree of accuracy to which the quotient is to be 
computed. For integer division, the absolute value of the numerator must be 
greater than the absolute value of the denominator. For fractional division, the 
absolute value of the numerator must be less than the absolute value of the 
denominator. 


Long Division: 
0000000000001 10 Quotient 
0000000000000101 
—101 
T{10 
—101 
11 Remainder 
SUBC Method: 
32 HIGH ACC LOW ACC 0 Comment 


0000000000000000 0000000000100001 (1) Dividend is loaded into ACC. The di- 
-10 1000000000000000 visor is left-shifted 15 and subtracted 
OO OOTTTTTTTTIOTTTTT from ACC. The subtraction is nega- 
tive, so discard the result and shift 
left the ACC one bit. 


| | | 

0000000000000000 0000000001000001 (2) 2nd subtract produces negative an- 
-10 1000000000000000 swer, So discard result and shift ACC 
—10  OTTTTTITIOTTITIO (dividend) left. 


| | | 

0000000000000100 0010000000000000 (14) 14th SUBC command. The result is 
-10 1000000000000000 positive. Shift result left and replace 

“000000000000000T ~—1010000000000600 LSB with 1. 


0000000000000011 0100000000000001 (15) Result is again positive. Shift result 
-10 1000000000000000 left and replace LSB with 1. 
“00060000000000000 ~~ 11000600000000007 
0000000000000001 1000000000000011 (16) Last subtract. Negative answer, so 
-10 1000000000000000 discard result and shift ACC left. 
Tm ETT TTT TT TTT IOT 
“00000000000000TT ~0000000000000TT0 Answer reached after 16 SUBC in- 


structions. 
| Remainder =| | Quotient | 
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Example 7-16. Integer Division Using SUBC 


+e + + 4+ © & &¢ F&F HF HF F 


This routine implements integer division with the SUBC instruction. For this 
integer division routine, the absolute value of the numerator must be greater 
than the absolute value of the denominator. In addition, the calling routine 
must check to verify that the divisor does not equal 0. 


The 16—bit dividend is placed in the low accumulator, and the high accumulator 
is zeroed. The divisor is in data memory. At the completion of the last 
SUBC, the quotient of the division is in the lower—order 16—bits of the 
accumulator. The remainder is in the higher-order 16—bits. 


Key C5x Instruction: 
RETCD return if conditions true — after executing next 2—word instruction or 
two single—word instructions 


DENOM .set 60h 
NUMERA .set 61h 
QUOT -set 62h 
REM -set 63h 
TEMSGN .set 64h 


* 


INTDIV LDP #0 
LT NUMERA ;Determine sign of quotient. 
MPY DENOM 
* 
SPH TEMSGN ;Save the sign 
LACL DENOM 
ABS ;Make denominator and numerator positive. 
SACL DENOM sSave absolute value of denominator 
LACL NUMERA 
ABS 


* 
* 
* 


If divisor and dividend are aligned, division can start here. 


RPT #15 :16 cycle division. Low accumulator contains 
SUBC DENOM sthe quotient and high accumulator contains the 

sremainder at the end of the loop. 

BIT TEMSGN, 0 :Test sign of quotient. 

RETCD NTC ;Return if sign positive, else continue. 
SACL QUOT ;Store quotient and remainder during delayed 
SACH REM ;return. 

LACL #0 ;If sign negative, negate quotient 

RETD sand return 
SUB QUOT 
SACL QUOT 
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Example 7-17. Fractional Division Using SUBC 


This routine implements fractional division with the SUBC instruction. For 
this division routine, the absolute value of the denominator must be 


calling routine must check to verify that the divisor does not equal 0. 


The 16-bit dividend is placed in the high accumulator, and the low accumulator 


;Determine sign of quotient. 


* 
* 
* 
* greater than the absolute value of the numerator. In addition, the 
* 
* 
* 
* is zeroed. The divisor is in data memory. 
* 
DENOM -set 60h 
NUMERA set 61h 
QUOT set 62h 
REM -set 63h 
TEMSGN -set 64h 
* 
FRACDIV LDP #0 
LT NUMERA 
* 
MPY DENOM 
SPH TEMSGN 
LACL DENOM 
ABS 


SACL DENOM 
LACC NUMERA, 16 
ABS 


* 


* If divisor and dividend are 


RPT #15 
SUBC DENOM 


BIT TEMSGN, 0 

RETCD NTC 

SACL QUOT 
SACH REM 

* 

LACL #0 

RETD 
SUB QUOT 
SACL QUOT 
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sMake denominator and numerator positive. 


;Load high accumulator, zero low accumulator. 


aligned, division can start here. 


:16-cycle division. Low accumulator contains 
sthe quotient and high accumulator contains the 
;remainder at the end of the loop. 


;Test sign of quotient. 

;Return if sign positive, else continue. 
;Store quotient and remainder during delayed 
;return. 


sIf sign negative, negate quotient 
sand return 
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7.9 Floating-Point Arithmetic 


To implement floating-point arithmetic on the ’C5x, operands must be con- 
verted to fixed point for arithmetic operations and then converted back to float- 
ing point. Conversion to floating-point notation is performed by normalizing the 
input data. 


To multiply two floating-point numbers, the mantissas are multiplied and the 
exponents added. The resulting mantissa must be renormalized. Float- 
ing-point addition or subtraction requires shifting the mantissa so that the ex- 
ponents of the two operands match. The difference between the exponents is 
used to left-shift the lower power operand before adding. Then, the output of 
the add must be renormalized. 


The ’C5x instructions used in floating-point operations are NORM, SATL, 
SATH, and XC. NORM may be used to convert fixed-point numbers to float- 
ing-point. SATL in combination with SATH provides a two-cycle 0—31-bit right 
shift. XC helps avoid extra cycles caused by branch instructions. 


Example 7-18 and Example 7—19 show how to implement floating-point arith- 
metic on ’C5x devices. Floating-point numbers are generally represented by 
mantissa and exponent values. Single-precision IEEE floating-point numbers 
are represented by a 24-bit mantissa, an 8-bit exponent, and asign bit. In order 
to simplify the routines, a format slightly different from the IEEE format is used. 
Four words are occupied by each floating-point number. One sign word, one 
word for exponent, and two words for mantissa are reserved in memory as de- 
scribed in the code below. 


Example 7—18. Floating-Point Addition Using SATL and SATH 


+e ¢ + £¢ e+e ete + He + Ht SH He EH SF EH HF HF HF 


-title ‘Floating Point Addition Algorithm’ 
def FL ADD 


tt A A i A i 


THIS SUBROUTINE ADDS TWO FLOATING—POINT NUMBERS PRODUCING 
A NORMALIZED FLOATING-POINT PRODUCT. THE FORMAT OF FLOATING— 
POINT NUMBERS IS SPECIFIED BELOW. 


INPUT / OUTPUT FORMAT 


| ALL 0 OR 1 | SIGN WORD 

| 16 BITS | EXPONENT 

|0| 15 BITS | HIGH PART OF MANTISSA 
| 16 BITS | LOW PART OF MANTISSA 


Key C5x Instructions: 
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TC=1 if two values are same 
TC=0 otherwise 


* 

* SAMM save the accumulator contents in a memory—mapped 
* register 

*  LACB accumulator is loaded with contents of accumulator 
* buffer 

* SACB contents of accumulator are copied in accumulator 
* buffer 

* SATL accumulator is barrel—shifted right by the value 
* specified in the 4 LSBs of TREG1 

*  SATH accumulator is barrel—shifted right by 16 bits 

* if bit 4 of TREG1 is a one. 

*  SPLK store immediate long constant in data memory 

* CPL compare long immediate value (or DBMR) with data 

* memory 

* 

* 

* 


TREG1 -set Odh 


ASIGN .set 60h ;Sign, exponent, high and low part of mantissa 
AEXP -set 61h ;of input number A 
AHI -set 62h 
ALO -set 63h 
BSIGN .set 64h ;Sign, exponent, high and low part of mantissa 
BEXP -set 65h ;Oof input number B 
BHI -set 66h 
BLO -set 67h 
CSIGN .set 68h :;Sign, exponent, high and low part of mantissa 
CEXP -set 69h ;0f the resulting floating point number C 
CHI -set 6Ah 
CLO -set 6Bh 
DIFFEXP .set 6Ch 
-text 
FL ADD LDP #0 ;Initialization 
SETC SXM 7Set sign extension mode 
MAR * ,ARO sARP <— ARO 
LAR ARO, #0 ;ARO is used by NORM instruction 
CMPEXP LACL BLO sLoad low Acc with BLO 
ADD BHI,16 ;Add BHI to high Acc 
SACB ;AccB = BHIBLO 
LACC AEXP 
SUB BEXP ;Acc = AEXP=BEXP 
SACL DIFFEXP sSave the difference 
BCND AEQB,EQ :If : == |B| 
BCND ALTB,LT ;If |A| < |B| 
AGTB ~~ LACC DIFFEXP ;If |A| > |B| 
SAMM TREG1 ;Load TREG1 with # of right shifts reqd. 
SUB #32 
BCND AGRT32,GEQ ;If difference > 32 
LACB ;Acc = BHIBLO 
SATL 
SATH ;Right justify BHIBLO 
SACB ;Store the result back in AccB 
AEQB LACC ASIGN ;Copy sign and exponent values of 
SACL CSIGN ;A in C (i.e. the result) 


CHKSGN LACC ASIGN s Acc=ASIGN—BSIGN 
SUB BSIGN 
CLRC TC ;Clear TC flag 
XC 1,LT :If A<0 and B>0 
SETC TC ;Set TC flag 
BCNDD ADNOW,EQ sIf both A and B have same sign 
LACL ALO 
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CZERO 


ADNOW 


NORMAL 


OUTPUT 


OVFLOW 


ALTB 


AHI, 16 


1,TC 


CZERO,EQ 
2,LT 


#0FFFFH,CSIGN 


#0,CSIGN 


NORMAL 
CHI 
CLO 


#0 
CEXP 
CSIGN 


CHI 
CLO 


OVFLOW, OV 
CHI 
CLO 
CZERO,EQ 


#0,CHI 


2, 2c 
CLO, 16 
ARO, #16 
2,NTC 
CHI,16 
CLO 
SXM 
2,LT 

1 


SXM 
#13 
++ 


CHI 

CLO 

CEXP 
ARO, CEXP 


CEXP 
CEXP 


SXM 


DIFFEXP 


TREG1 
#32 
BGRT32 ,GEQ 


sAcc = AHIALO 

; ACC=A-—B 

sIf A<O and B>0 

sthen Acc=B—-A 

sIf A-B == 0 

If A-B < 0 
then CSIGN=—1 


If A-—B<0 

then Acc=|A-B| 
;delayed branch 
;Save the result 


:If A-B == 0 

;then result is zero 
;Make sign positive 
;Return delayed 


sClear CHICLO 


;If signs are same 
sthen add two numbers 


sSave it in CHICLO 
;If CHICLO is zero, goto CZERO 


;Compare CHI with 0 

;Dead cycle for XC 

sIf CHI is 0 

;then normalize only the CLO part 
;ARO has exponent value 

;If CHI != 0 

;Acc=CHICLO 


sDisable sign extension mode 
sIf MSB of CLO is l 

;then shift right once 

;and decrement exponent. 
;Enable sign extension mode 
;Repeat 14 times 

sNormalize 


;Store high part 
;Store low part of the result 


;Save exponent 
;Return delayed 


; CEXP=CEXP—ARO 


;Disable sign extension mode 
;Shift Acc right 


sSave the result 


;Increment exponent by one 
;Save it 


;Copy sign of B inC 

;Copy exponent of B in C 

;Since A-B < 0 here 

;No. of shifts reqd. for right—justification 
;difference in exponent >= 32 


; ACcc=AHIALO 


7-33 


Extended-Precision Arithmetic 


SATH ;Right—justify ALOAHI 
BD CHKSGN ;Jump back after next two instructions 
SACL ALO ;Save normalized value 
SACH AHI ;in ALO and AHI 
BGRT32 LACC BHI ;If exponent of B > 32 
SACL CHI ;then C <— B. 
RETD sReturn after 
LACC BLO ;saving CHI and CLO 
SACL CLO 
AGRT32 LACC AHI ;If exponent of A > 32 
SACL CHI sthen C <— A. 
LACC ALO 
SACL CLO ;Copy ALO to CLO 
LACC ASIGN 
SACL CSIGN ;Copy ASIGN to CSIGN 
RETD :Return after 
LACC AEXP ;copying AEXP to CEXP 
SACL CEXP 


Example 7-19. Floating-Point Multiplication Using BSAR 


etitle ‘Floating Point Multiplication Routine’ 


THIS SUBROUTINE MULTIPLIES TWO FLOATING—POINT NUMBERS PRODUCING 
A NORMALIZED FLOATING—POINT PRODUCT. THE FORMAT OF FLOATING— 
POINT NUMBERS IS SPECIFIED BELOW. 


INPUT / OUTPUT FORMAT 


| ALL 0 OR 1 | SIGN WORD 

| 16 BITS | EXPONENT 

{o| 15 BITS | HIGH PART OF MANTISSA 
| 16 BITS | LOW PART OF MANTISSA 


NOTE THAT EVEN IF THE PRODUCT IS ZERO, SIGN OF THE PRODUCT MAY 
EITHER BE POSITIVE OR NEGATIVE DEPENDING ON THE INPUTS. 


Key C5x Instructions: 

BSAR 1~16 bit right barrel arithmetic shift in one cycle 

CLRC reset control bit 

SETC set control bit 

BD branch after executing next two one—word instructions 
or one two-word instruction 


+eene eee eee eee eH eee He HE He HF HH HH HE HE HE HE HE HF HS OF 


ASIGN .set 60h ;Sign, exponent, high and low parts of mantissa 
AEXP -set 61h ;0Of input number A 

AHI -set 62h 

ALO -set 63h 

BSIGN .set 64h ;Sign, exponent, high and low parts of mantissa 
BEXP -set 65h ;of input number B 

BHI -set 66h 

BLO -set 67h 

CSIGN .set 68h ;Sign, exponent, high and low parts of mantissa 
CEXP -set 69h ;of the resulting floating point number C 
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NZERO 


SIGN 


6ah 
6bh 


#0 
ARO, #0 
0 


AEXP 
BEXP 
CEXP 
SXM 
ALO 
BHI 
AHI 
BLO 
BHI 
16 


NZERO,NEQ 
CHI 

SIGN 

CLO 

CEXP 


*+ 

CHI 

CLO 

SXM 

CEXP 
ARO, CEXP 
CEXP 
CEXP 
ASIGN 


BSIGN 
CSIGN 


sARP <— ARO 
;Reset exponent counter 
*No left shift of P register 


;CEXP = AEXP + BEXP 

sfor barrel shift, disable sign extension 
;T = ALO 

;P = ALO*BHI 
;Acc=ALO*BHI, 
;P=AHI*BLO 
;Acc=ALO*BHI + AHI*BLO, P=AHI*BHI 
;Retain upper 16 bits plus 1 additional 
sbit due to zero MSBs of BLO & ALO 

;If the product is not zero 

;If the product is zero 

sthen clear CHI,CLO and CEXP 

;and jump to SIGN 


T=AHI 


;Discard additional sign bit (Q63) 
;Remove leading zero if any 
;Save product 


;Enable sign extension mode 

* CEXP<—ARO 

: CEXP=CEXP—ARO 

;If signs are same then product is +ve 


:Return after next two instructions 
sotherwise it is —ve. 
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7.10 Application-Oriented Operations 


7.10.1 Modem Application 


Digital signal processors are especially appropriate for modem applications. 
The ’C5x devices with their enhanced instruction set and reduced instruction 
cycle time are particularly effective in implementing encoding and decoding al- 
gorithms. Features like circular addressing, repeat block, and single-cycle bar- 
rel shift reduce the execution time of such routines. 


Example 7—20 implements a differential and convolutional encoder for a 9600- 
bit/s V.32 modem. This encoder uses trellis coding with 32 carrier states. The 
data stream to be transmitted is divided into groups of four consecutive data 
bits. The first two bits in time Q1, and Q2, in each group are differentially en- 
coded into Y1,, and Y2, according to the following equations: 


Y1p = Qi, © Y1n-4 

Y2n = (Q1, be Y1 n—1) ® Y2n-1 @® Q2, 
This is done by a subroutine called DIFF. The two differentially encoded bits 
Yin and Y2n are used as inputs to a convolutional encoder subroutine EN- 


CODE, which generates a redundant bit YOn. These five bits are packed into 
a single word by the PACK subroutine. 


Example 7-20. V.32 Encoder Using Accumulator Buffer 
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title ‘Convolutional Encoding for a V.32 Modem’ 
-mmregs 

STATMEM -set 60h (60h — 62h) Delay States S1,82,83 
INPUT -set 64h :(64h — 67h) Four input bits 
YPAST -set 68h :(68h — 69h) Past values of Yl and Y2 
OUTPUT set 63h ;¥0, the redundant bit 
LOCATE set 6ah ;Temporary storage for current input word 
PCKD_IP -set 1000h ;Input buffer (4 bits packed per word) 
PCKD_OP set 2000h ;Output buffer (5 bits packed per word) 
COUNT set 50 ;# of input data words 

text 
INIT LAR AR1,#PCKD_IP 

LAR AR2,#PCKD_OP 

LAR AR3 , #COUNT—1 :COUNT contains # of input words 

LDP #0 
START MAR *, ARI 

LACC *+,0,AR0 

SACL LOCATE ;Temporary storage for current input word 

LAR ARO, #INPUT+3 

LACL #3 ;Loop 4 times 

SAMM BRCR 

LACL #1 

SAMM DBMR sLoad DBMR with the mask for LSB 
UNPACK LACC LOCATE ;Acc = packed input bits 


RPTB LOOP 1—1 sfor I=0,I<=3,I++ 
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LOOP1 


PACK LAR 


LOOP2 


me me MO TO 


DIFF 


e 
c 
e 
e 
e 
a 


DIFF 
ENCODE 


ARO, #INPUT 
#3 


BRCR 
*+ 


LOOP2—1 
e+ 
* ,AR2 


*+,0,AR3 
START 


sSave it 
sMask off all bits except LSB 
sShift right to get next bit 


sCall differential encoder 
sCall convolutional encoder 


;Loop 4 times only 


;Get first bit (MSB) 


;for I=0,I<=2,1++ 
zmake space by left—shifting once 
;Pack next bit by left-—shifting other 


sARP <— AR2 

;Save it in packed form 
;Loop if COUNT is not zero 
;Return 


This subroutine differentially encodes Qln and Q2n (INPUT 
buffer) according to previous output values Yin-1 and 

Y2n—1 (YPAST buffer). The resulting values Yln and Y2n overwrite 
previous Qin and Q2n. 


YPAST 
INPUT 
INPUT+1 
YPAST+1 
INPUT+1 
YPAST+1 
YPAST 
INPUT 


INPUT 
YPAST 


sAcc=Y1ln—1 

7Qln & Yln—1 

7(Qin & Yin—1) xor Q2n 

:(Qln & Yiln—1) xor Q2n xor Y2n—1 


:Save Y2n 


:Qln xor Yin—1 
;Delayed return 
sSave Yln 

ssave Yln—l1 


This subroutine generates a redundant bit YOn by convolutional encoding, 
taking Yln and Y2n as input. Three delay states Sl, S2 and S3 are 


located in STATMEM buffer. 


ENCODE LACC 
SACL 
LACC 
XOR 
SACB 
LACC 
AND 
XORB 
SACL 
LACC 
ANDB 
SACB 
LACC 
XOR 
XOR 
XORB 
SACL 
RETD 

LACC 
SACL 


STATMEM 
OUTPUT 
INPUT+1 
STATMEM+1 


OUTPUT 
INPUT 


STATMEM 
OUTPUT 


INPUT 
INPUT+1 
STATMEM+2 


STATMEM+1 


OUTPUT 
STATMEM+2 


YO <— $1 


sY2 xor S2 
*Save in AccB 


sY¥O & Y1 
;(Y¥O & Y1) xor (Y2 xor S2) 
sSave it in Sl 


;¥O & (Y¥2 xor $2) 


7Y¥1 xor Y2 

;(Y¥1 xor Y2) xor S83 

7((¥1 xor Y2) xor S3) xor (YO & (Y2 xor S2)) 
;Update S2 

;Delayed return 


;Update S3 
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There are many practical applications of adaptive FIR/IIR filtering; one exam- 
ple is in the adapting or updating of coefficients. This can become computa- 
tionally expensive and time-consuming. The MPYA, ZALR, and RPTB instruc- 
tions on the ’C5x can reduce execution time. 


A means of adapting the coefficients on the ’C5x is the least-mean-square al- 
gorithm given by the following equation: 


b, (i+ 1) = b, (i) + 2Be()x(i — k) 


where e (i) = x (i) — y (i) 
and 


yi) = 5b xi-h) 


k=O 


Quantization errors in the updated coefficients can be minimized if the result 
is obtained by rounding rather than truncating. For each coefficient in the filter 
at a given point in time, the factor 2*B*e(i) is a constant. This factor can then 
be computed once and stored in the T register for each of the updates. 


MPYA and ZALR instructions help in reducing the number of instructions in the 
main adaptation loop. Furthermore, the RPTB (repeat block) instruction allows 
the block of instructions to be repeated without any penalty for looping. 


Example 7—21 shows a routine that implements a 128-tap FIR filter and an 
LMS adaptation of its coefficients. The single-access internal RAM of the 
‘C50/C51 can be mapped in both the program and data spaces at the same 
time by setting OVLY and RAM control flags to 1. This feature can be used to 
advantage by locating the coefficients table in single-access internal RAM so 
that it can be accessed by MACD and MPY instructions without modifying 
RAM configuration. Note that the MACD instruction requires one of its oper- 
ands to be in program space. 


If the address of the coefficient table is to be determined in runtime, load the 
BMAR (block move address register) with the address computed dynamically 
and replace the instruction 


MACD COEFFP,*— 


by 
MADD *- 
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Example 7-21. Adaptive FIR Filter Using RPT and RPTB 


-title ‘Adaptive Filter’ 


def ADPFIR 
def X,Y 
-mmregs 
* 
* This 128—-tap adaptive FIR filter uses on-chip memory block BO for 
* coefficients and block Bl for data samples. The newest input should 
* be in memory location X when called. The output will be in memory location Y 
* when returned. 
* OVLY =1 , RAM =1 when this routine is called. 
* 
COEFFP .set 02000h ;Program memory address of the coeff. in S/A RAM 
COEFFD .set 02000h ;Data memory address of the coeff. in S/A RAM 
* For TMS320C51, COEFFD is 0800h instead of 02000h 
ONE -set 7Ah ;Constant one. (DP=0). 
BETA -set 7Bh ;Adaptation constant. (DP=0). 
ERR -set 7Ch ;Signal error. (DP=0). 
ERRF set 7Dh sError function. (DP=0). 
Y -set 7Eh 7Filter output. (DP=0). 
X -set 037Fh ;Newest data sample. 
FRSTAP .set 0380h ;Next newest data sample. 
LASTAP .set 03FFh :;Oldest data sample. 


* 


* Finite impulse response (FIR) filter. 


* 


ADPFIR ZPR 
LACC 
MAR 
LAR 
RPT 
MACD 
APAC 
SACH 
NEG 
LAR 
ADD 
SACH 
DMOV 


FIR 


* 


* LMS Adaption 
* 


#1,14 

* , AR3 
AR3,#LASTAP 
#127 
COEFFP, *— 


;Clear P register. 
;Load output rounding bit. 


;Point to oldest sample. 
3128-tap FIR filter. 


;Store the filter output. 
;Acc = —y(n) 


;Add the newest input sample. 
yerr(n) = x(n) — y(n) 
;Include newest sample 


of Filter Coefficients. 


ERR 
BETA 


ONE, 14 
ERRF, 1 


#126 
BRCR 


AR2,#COEFFD 
AR3, #LASTAP 
ERRF 

*—,AR2 

Acc = ak(i) + 
*+ 


* ,AR3 


e+ 


He err 

2P beta*err(i) 
serrf(i) = beta * err(i) 
:Round the results. 
;Save errf(i) 


3127 coefficients to update 
in the loop. 

:Point to the coefficients. 
;Point to the data samples. 


7P 2*beta*err(i)*x(i-255) 
sFor I=0,I<=126,1I++ 

;Load ACCH with ak(i). 

7P 2*beta*err(i)*x(i-—k—-1) 
2*beta*err(i)*x(i—-k) 

sStore ak(i+1) 


;Finally update last coeff. a0(i) 


;Delayed return 
sAcc = a0(i) + 2*beta*err(i)*x(i) 
;Save a0(i+1) 
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Infinite impulse response (IIR) filters are widely used in digital signal process- 
ing applications. The transfer function of an IIR filter is given by: 


bo + b,z-1 +... + byz-™ _Y(2z) 


bi Rada Er az-i+..+ayz-% — X(z) 


Figure 7—5 shows a block diagram of an Nth order direct-form II IIR filter: 
Figure 7-5. Nth Order Direct-Form Type II IIR Filter 


x(n) 


In the time domain, an Nth order IIR filter is represented by the following two 
difference equations: 


at time interval n: 

x(n) is the current input sample 

y(n) is the output of the IIR filter 

d(n) = x(n) — d(n—1)a, — ... — d(n—N+1)ay_y 

y(n) = d(n)bpg + d(n—1)b, +... + d(nN—N+1)by_y 
The two equations above can easily be implemented on the ’C5x by using mul- 
tiply-accumulate instructions (MAC, MACD, MADS, MADD). Note that the 
second equation would also require a data-move operation to update the state 
variable sequence d(n). Example 7—22 implements an Nth order IIR filter us- 


ing single-instruction repeat (RPT) and multiply-accumulate (MAC, MACD) in- 
structions. 


Example 7-22. Using RPT and MACD 


etitle "Nth Order IIR Type II Filter” 
-mmregs 
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Application-Oriented Operations 


* 
* This routine implements an N—th order type II IIR filter. 
* d(n) = x(n) — d(n—-1l)al — d(n-2)a2 +...— d(n—N+1)aN—1 
* y(n) = d(n)bO + (dn—1)b1l +...+ d(n—N+1)bN-1 
* Memory Requirement: 
* State variables (low to high data memory): 
* d(n) d(n—1) ... d(n—N+1) 
* Coefficient (low to high program memory): 
* b(N-1) b(N-2) ... b(1) —a(N-1) —a(N-2) ... —a(1) —a(0) 
* Entry Conditions: 
* ARO —> Input 
* AR1 —> d(n—N+1) 
* AR2 —> Output 
* COEFFA —> —a(N—1) 
* COEFFB —> b(N—1) 
* ARP = ARO 
* 
IIR_N 

ZPR ;Clear P register 

LACC *,15,AR1 sGet Q15 input 

RPT #(N—2) sfor i=1,i<=N-1,++i 

AC COEFFB,*— ;Acc+=—a(N—i) ) *d(n—N+i) 

APAC :Final accumulation 

SACH *,1 ;Save d(n) 

ADRK N-1 ;AR1 —> d(n—N+1) 

RPTZ #(N—-1) sfor i=1,i<=N,++i 

MACD COEFFA,*— ;Acct+=b(N—i) *d(n—N+i) 

LTA * ,AR2 sFinal accumulation 

SACH * a ;Save Yn 


Due to the recursive nature of an IIR filter, quantization of filter coefficients may 
cause significant variation from the desired frequency response. To avoid this 
problem, the desired filter transfer function can be broken up into lower order 
sections that are cascaded with each other. The following example shows an 
implementation of N cascaded second-order IIR sections (also called biquad 
sections). The filter coefficients and the state variables are stored in data 
memory. Note the use of LTD and MPYA instructions to perform multiply-accu- 
mulate and data-move operations. 
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Example 7-23. Using LTD and MPYA 


etitle "N Cascaded BiQuad IIR Filters” 
-mmregs 


This routine implements N cascaded blocks of biquad IIR 
canonic type II filters. Each biquad requires 3 data 
memory locations d(n),d(n-1),d(n—2), and 5 coefficients 
—al,—a2,b0,b1,b2. 
For each block: d(n) = x(n)—d(n—1)al—d(n—2)a2 

y(n) = d(n)b0+d(n—1)b1+d(n—2)b2 
Coefficients Storage: (low to high data memory) 

—a2,—al,b2,b1,b0, ... ,—a2,—al,b2,b1,b0 
lst biquad Nth biquad 


State Variables: (low to high data memory) 
a(n) ,d(n—1),d(n-—2), eee ,a(n),d(n—1),d(n—2) 
Nth biquad lst biquad 


Entry Conditions: 
AR1 —> d(n-2) of lst biquad 
AR2 —> —a2 of lst biquad 
AR3 —> input sample (Q15 number) 
AR4 —> output sample (Q15 number) 
DP = 0, PM = 0, ARP = 3 


+ + +t © He +e + € HF HF HF HF SF HF HF HF He SF FF 


BIQUAD: ; Setup variables 

ZPR ; Clear P register 
LACC *,15,AR1 ; Get Q15 input 

SPLK #2, INDX ; Setup index register 
SPLK #N—1,BRCR ; Setup count 


; Begin computation; 
RPTB ELOOP—1 ; repeat for N biquads 


LOOP : 
LT *—,AR2 ; T = d(n-2) 
MPYA *+,AR1 ; Acc = x(n), P = —d(n—2)a2 
LTA *—,AR2 ; Acc += —d(n-—2)a2, T = d(n-1) 
MPY *+ ; P = —d(n—1l)al 
LTA *+,AR1 ; Acc += —d(n—l)al, T = b2 
SACH *0+,1 ; Save d(n) 
MPY *— ; P = d(n—2)b2 
LACL #0 ; Acc = 0 
LTD *—,AR2 > T = d(n-1), d(n—2) = d(n—1) 
MPY *+,AR1 ; Acc += d(n—2)b2, P = d(n—1)bl 
LTD *—,AR2 ; T = d(n), d(n—-1) = d(n) 
MPY *+,AR1 s Acc += d(n—-1)bl, P = d(n)b0 
ELOOP: 
LTA *,AR4 : Final accumulation 
SACH *,1 ; Save output in Q15 format 


7.10.4 Dynamic Programming 


Dynamic programming techniques are widely used in optimal search algo- 
rithms. Applications such as speech recognition, telecommunications, and ro- 
botics use dynamic programming algorithms. The ’C5x digital signal proces- 
sors have an enhanced instruction set for efficient implementation of dynamic 
programming methods. 


Most real-time search algorithms use the basic dynamic programming princi- 
ple that the final optimal path from the start state to the goal state always 
passes through an optimal path from the start state to an intermediate state. 
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Identifying intermediate paths reduces a long, time-consuming search to the 
final goal. An integral part of any optimal search scheme based on the dynamic 
programming principle is the backtracking operation. The backtracking is nec- 
essary to retrace the optimal path when the goal state is reached. 


Example 7-24 shows an implementation of the backtracking algorithm in 
which the path history consists of four independent path traces for Ntime peri- 
ods. This path history is stored in a circular buffer. After each back-tracking op- 
eration, the path history is updated by a search algorithm (not shown) for the 
next time period. The path history buffer is shown in Figure 7—6 for N equal to 
4. Each group of four consecutive memory locations in the buffer corresponds 
to the expansion of the four paths by one node (or by one time period). Each 
element of a group corresponds to one of the four states in that time period. 
In addition, each element of a group points to an element in the previous time 
period that belongs to that path. 


As an illustration of backtracking using the path history buffer shown in 
Figure 7—4, the element corresponding to state #0 at the current time period 
contains a 1. This points to the second element of the previous time period that 
contains a 0. In this way, beginning from the current time period and using 
pointers to step back in time, this path is traced back as 1—O—2—1. Note that 
this simplified backtracking approach is taken here to illustrate 'C5x program- 
ming techniques. Most real applications would require more complex back- 
tracking algorithms. 


Figure 7—6. Backtracking With Path History 


State #0 
1 
2 
3 
n—4 n-3 n-2 n—-1 n 
Path Trace for 5 Periods Current Time Period 
Buffer + 15 


n—-1 n n—-3 n-2 
* Current Time Period 
Path History Circular Buffer (N = 4) 
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Example 7-24. Backtracking Algorithm Using Circular Addressing 


* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 


Backtracking Example 

This program back-tracks the optimal path expanded by 
a dynamic programming algorithm. The path history 
consists of four paths expanded N times. It is set up 
as a circular buffer of length N*4. 

Note that decrement type circular buffer is used. 

The start and end address of the circular buffer are 
initialized this way because of two reasons: 

1— to avoid skipping the end—address of circ buffer 
2— to ensure that wrap—around is complete before next 
iteration. 


LAR ARO,#BUFFER ;get buffer address 
LMMR INDX, PATH ;get the selected path [0..3] 
SPLK #N~—1,BRCR strace back N time periods 
init. ARO as pointer to circular buffer#1; length=N*4 words 
SPLK #BUFFER+(N—1)*4,CBSR1 
SPLK #BUFFER—3 , CBER1 
SPLK #08h,CBCR 


RPTB TLOOP—1 :for i=0,i<N, i++ 

MAR *O+ ;offset by state# 

LACC *0— ;get next pointer & reset to state#0 
SAMM INDX ;save next state# 

SBRK 3 ;decrement ARO to avoid skipping CBER1 
SBRK 1 snow ARO is correctly positioned 1 time 


TLOOP: ;period back (circular addressing) 
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7.11 Fast Fourier Transforms 


Fourier transforms are an important tool often used in digital signal processing 
systems. The purpose of the transform is to convert information from the time 
domain to the frequency domain. The inverse Fourier transform converts infor- 
mation back to the time domain from the frequency domain. Computationally 
efficient implementations of the Fourier transforms are known as fast Fourier 
transforms (FFT). 


The ’C5x reduces the execution time of all FFTs by virtue of its 50-ns instruction 
cycle time. Also, the bit-reversed addressing mode helps reduce execution 
time for radix-2 FFTs. As demonstrated in Figure 7—7 and Figure 7-8, the in- 
puts or outputs of an FFT are not in sequential order. This scrambling of data 
locations is a direct result of the radix-2 FFT derivation. Observation of the fig- 
ures and the relationship of the input and output addressing reveal that the ad- 
dress indexing is in bit-reversed order, as shown in Table 7—1. As a result, ei- 
ther the input data sequence or the output data sequence must be scrambled 
in association with the execution of the FFT. In Example 7—27, the input data 
is scrambled before the execution of FFT algorithm so that the output is in or- 
der. | 


Figure 7—7. An In-Place DIT FFT With In-Order Outputs and Bit-Reversed Inputs 


Stage 1 Stage 2 Stage 3 
x(0) x(0) 


Wy DO 
x(4) ——a, x(1) 
W © 
_— “Tew . 
—_ eNO SX “6 
Se 
Wo ] 
x(7) x(7) 


Legend for twiddle factor: W, = Wg W, = Wg W, = Wé W, = W 
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Figure 7-8. An In-Place DIT FFT With In-Order Inputs but Bit-Reversed Outputs 


x(0) 
x(4) 
x(2) 
x(6) 
x(1) 
x(5) 
x(3) 
x(7) 


Legend for twiddle factor: W, = W 


Wo 


Stage 1 Stage 2 Stage 3 
x(0) 


NZ : 
ERE 
x(4) 
QC 


Table 7-1. Bit-Reversal Algorithm for an 8-Point Radix-2 DIT FFT 
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| Index | Bit Pattern Bit-Reversed Pattern | Bit-Reversed Index 


NO Oh @QN =O 


The bit-reversed addressing mode is part of the indirect addressing implem- 
ented with the auxiliary registers and the associated arithmetic unit. In this 
mode, a value (index) contained in INDX is either added to or subtracted from 
the auxiliary register being pointed to by the ARP. However, the carry bit is not 
propagated in the forward direction; instead, it is propagated in the reverse di- 
rection. The result is‘a scrambling in the address access. 


The procedure for generating the bit-reversed address sequence is to load 
INDX with a value corresponding to one-half the length of the FFT and to load 
another auxiliary register—for example, AR1—with the base address of the 
data array. However, implementations of FFTs involve complex arithmetic; as 
a result, two data memory locations (one real and one imaginary) are asso- 
ciated with each data sample. For ease of addressing, the samples are stored 
in workspace memory in pairs with the real part in the even address locations 
and the imaginary part in the odd address locations. This means that the offset 
from the base address for any given sample is twice the sample index. If the 
incoming data is in the following form: 
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XR(0),XR(1),-.-.,XR(7),XI(0),XI(1),.--,X1(7) 
WHERE 

XR — real component of input sample 

XI — imaginary component of input sample 


then it is easily transferred into the data memory and stored in the scrambled 
order: 
XR(0),XI(0),XR(4),X1I(4),XR(2),X1I(2),...XR(7),XI(7) 


by loading INDX register with the size of FFT and by using bit-reversed ad- 
dressing to save each input word. 


The following list shows the contents of auxiliary register AR1 when INDX is 
initialized with a value of 8 and when the data is being transferred by the code 


that follows. 

MSB LSB 
INDX 0000 0000 0000 #100 0FOR 8-POINT FFT 
ARI 0000 00310 0000 00 0 0BASE ADDRESS 

RPT 15 

BLDD #INPUT, *BRO+ 
AR1 0000 0010 0000 00 0 0 XR(0) 
ARl 0000 0010 0000 10 0 0xR(4) 
AR1 0000 0010 0000 01 0 0 XR(2) 
ARI 0000 0010 0000 1410 0 XR(6) 
AR1 0000 0010 0000 001 0 XR(1) 
AR1 0000 0010 0000 101 0 XR(5) 
AR1 0000 0010 0000 011 0 XR(3) 
ARl1 0000 0010 0000 111 0xXR(7) 
ARI 0000 0010 0000 00 0 1 XI(0) 
AR1 0000 0010 0000 100 1XI(4) 
ARI] 0000 0010 0000 010 1XI(2) 
AR1 0000 0010 0000 1410 1XI(6) 
AR1 0000 0010 0000 001 1XI(1) 
AR1 0000 0010 0000 101 1XI(5) 
AR1 0000 0010 0000 011 1XI(3) 
ARI 0000 0010 0000 1411 1XI(7) 


This is shown in the FFT subroutine for 16 input samples. 
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Example 7-25. Macros for 16-Point DIT FFT 


KRHEKEKKKEKKEKEKKEEEEEKKEKEEEEEEKEEEKEEREKEKEEEKEEKKEREKREKKEEEEKEKKKEEKKKEKKKKEKKKKKKKKKKK 


* FILE: cScxrad2.mac —> macro file for radix 2 fft’s based on 320c5x * 
* * 
* COPYRIGHT TEXAS INSTRUMENTS INC. 1990 * 
KRHA KGAA TAKA AATEC EEEKEKEKKEEKEKKKEKKKKKKEKKKKKRKRKRKKRKKEK KEK 
* * 
* MACRO ’COMBO2X’ FOR THE COMPLEX, RADIX-—2 DIT FFT * 
* * 
* ORGANIZATION OF THE INPUT DATA MEMORY: R1,I1,R2,12,R3,13,R4,14 * 
* * 
KHEAKKKKKEKKEKKEKKEKEKEKREEEKEEEKEKEAKEKEEKEKKEKKKKKKEKKKKKEKKKRKKKKKKKKKKKKRK KKK KEK 
* * 
* THE MACRO ‘COMBO2x’ PERFORMS FOLLOWING CALCULATIONS: * 
* * 
* Rl := [(R1+R2)+(R3+R4) ]/4 INPUT OUTPUT * 
* R2  := [(R1-R2)+(I3-14) ]/4 a * 
* R3 := [(R1+R2)—(R3+R4)]/4 ARO = 7 * 
* R44 := [(R1I-R2)—(13-14)]/4 AR1 —> R1,I1l AR1 — > R5,15 * 
* Il := [(I14+12)+(13+14)]/4 AR2 —> R2,12 AR2 — > R6,16 * 
* I2  := [(I1—-I2)—(R3—-R4) ]/4 ARP—> AR3 —> R3,I3 ARP — > AR3 — > R7,I17 * 
* I3) s= [(114+12)—(13+14) }]/4 AR4 —> R4,14 AR4 — > R8,I8 * 
* I4 := [(I1-I2)+(R3—R4)]/4 * 
* * 
* For a 16—point Radix 2 complex FFT the Macro ‘'COMBO2x’ has to be * 
* repeated N/4 times (e.g. 4 times for a 16 point FFT). * 
* * 
KKK KKK EKER EEREKKEKEEKEKEEKEEKEKKKKEKKKKKEEKKKKKEKKEKKK KKK 


* 


COMBO5x S$MACRO num REPEAT MACRO ’COMBO5x’: N/4 times 


me Se 


SPLK #:num:—1,BRCR execute ‘num’ times ‘COMBO5x’ 
* 
RPTB comboend j; ARP AR1 AR2 AR3 AR4 ARS5 
* iiceeees: + aapsieneiaata ~ ataiiemeesie’”eaniciganammens' ~ samasianaatas|-~Sommetinietece 
LACC *,14,AR4 ; ACC := (R3)/4 4 Rl R2 R3 R4 Tl 
SUB *,14,AR5 ; ACC := (R3—R4)/4 5 Rl R2 R3 R4 Tl 
SACH *+,1,AR4 ; Tl = (R3-R4)/2 4 Rl R2 I3 R4 12 
* 
ADD *+,15,AR5 ; ACC := (R3+R4)/4 5 Rl R2 R3 14 12 
SACH *,1,AR2  ; 172 = (R3+R4)/2 2 Rl R2 R3 14 72 
* 
ADD *,14,AR1 ; ACC := (R2+R3+R4)/4 1 Rl R2 R3 14 T2 
ADD *,14 > ACC := (R1+R2+R3+R4)/4 1 Rl R2 R3 T4 T2 
SACH *+,0,AR5 ; Rl := (R1+R2+R3+R4)/4 5 Il R2 R3 14 12 
SUB *,16,AR3  ; ACC := (R1+R2—(R3+R4))/4 3 I1 R2 R3 14 T2 
SACH *+,0,AR5 ; R3. := (R1+R2—(R3+R4))/4 5 Il R2 13 14 12 
* 
ADD *,15,AR2 ; ACC := (R1+R2)/4 2 Il R2 13 I4 T2 
SUB | *,15,AR3  ; ACC := (R1—-R2)/4 3 Il R2 13 I4 T2 
ADD *,14,AR4 ; ACC := ((R1I-R2)+(I3))/4 4 Il R2 13 14 12 
SUB *,14,AR2  ; ACC := ((RI-R2)+(1I3-14))/4 2 I1 R2 I3 14 12 
SACH *+,0,AR4 3; R2 := ((RI-R2)+(13-14))/4 4 Il I2 I3 14 12 
ADD *—,15,AR3 ; ACC := ((R1I—R2)+ I3+I4 )/4 3 I1 12 13 R4 T2 
SUB *,15,AR4 ; ACC := ((RI-R2)—(13-I4))/4 4 Il I2 13 R4 12 
SACH *+,0,ARL ; R4 := ((RI-R2)—(13-14))/4 1 Il 12 13 I4 172 
* 
LACC *,14,AR2  ; ACC := (11)/4 2 Il 12 I3 14 12 
SUB *,14,AR5 ; ACC := (I1-I2)/4 5 Il I2 13 14 72 
SACH *,1,AR2 : T2  s= (I11-12)/2 2 Il 12 13 4 T2 
ADD *,15,AR3 3; ACC := ((I1+1I2))/4 4 I1 12 I3 14 172 
ADD *,14,AR4 ; ACC := ((11+1I2)+(13))/4 4 Il 12 13 4 12 
ADD *,14,AR1 ; ACC := ((11+12)+(13+14))/4 1 I1 I2 13 I4 T2 
SACH *0+,0,AR3 ; Il := ((11+I12)+(I3+I14))/4 3 RS I2 13 14 12 
SUB *,15,AR4 3; ACC := ((11+I12)-(I3+14))/4 4 RS I2 13 14 12 
SUB *,15,AR3. ; ACC := ((I1+I2)—(13+14))/4 3 R5 I2 13 I4 172 
SACH *0+,0,AR5 ; I3\ := ((I1+I2)—(I13+I14))/4 5 R5 I2 R7 I4 172 
* 
LACC *¥—, 15 : ACC := (I1-I2)/4 5 R5 12 R7 14 Til 
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SUB *,15,AR2 ; ACC := ((I1-I2)—(R3—R4))/4 2 R5 I2 R7 14 Ti 

SACH *0+,0,AR5 ; I2 :* ((I1-I2)-(R3—R4))/4 5 RS R6 R7 4 Til 

ADD *,16,AR4 ; ACC := ((I1-I2)+(R3-R4))/4 4 RS R6 R7 I4 Til 
comboend: 

SACH *0+,0,AR3 ; I4 3: ((I1-I12)+(R3—-R4))/4 3 RS R6 R7 RB Tl 
* 

MAR * ,AR2 * ARP=AR2 

SENDM 


* 
HHA KH KKK KKK KKK KKK EKEKEEEKEKKEEEKEKKEKEKKEKEKKEKKKEKEKKKKKKKKKKKKEEK 
* 


* 

* MACRO ’ZEROI’ number of words ;: 10 * 
* * 
* ARP=2 FOR INPUT AND OUTPUT * 
* AR2 —> QR,QI,QRt1,... * 
* AR3 —> PR,PI,PR+1,... * 
* * 
* CALCULATE Re[P+Q] AND Re[P—Q] * 
* QR’=(PR—OR) /2 * 
* PR’=(PR+QR) /2 * 
* PI’=(PI+QI)/2 * 
* PI’=(PI-QI)/2 * 
* * 
KKK KKK AKER KKH KEEEKEKKEKEEKEKEKEKKEKKEKKEKKRKRKKKKRKKEKEK 
ZEROIL SMACRO : AR1 AR2 ARP 

LACC *,15,AR1L ; ACC := (1/2) (QR) PR QR 1 

ADD *,15 ; ACC := (1/2) (PR+QR) PR QR 1 

SACH *+,0,AR2 ; PR := (1/2) (PR+QR) PI OR 2 

SUB *,16 ; ACC := (1/2)(PR+QR)-—(QR) PI QR 2 

SACH *+4 ; QR := (1/2) (PR-QR) PI QI 2 
* 

LACC *,15,AR1 ; ACC := (1/2)(QT) PI QI 1 

ADD *,15 ; ACC := (1/2) (PI+QI) PI QI 1 

SACH *+,0,AR2 ; PI := (1/2) (PI+QT) PR+1 QI 2 

SUB *,16 s ACC := (1/2)(PI+QI)-—(QI) PR+1 QI 2 

SACH *+ ; QI. := (1/2) (PI-QT) PR+1 QR+1 2 

SENDM 
KWH KEKEEKEKE EKER HEE KEHKKREKKAEEEKEEEKEKHEEEKEKEKREKEKKKKEKKKKKRKEKKEKE 
* * 
* MACRO ‘/PBY2I’ number of words: 12 * 
* * 
* PR’=(PR+QI)/2 PI’=(PI—QR) /2 * 
* QR’ =(PR-QI)/2 QI‘=(PI+QR) /2 * 
% * 
KRHKKKKEKKEKEE KEKE EEK KKK KKEKKEKEKRKKKKKEKKEKRKRKKKKKEK 
PBY2I $MACRO ; AR1 AR2 ARP 

LACC *+,15,AR5 ; PR QI 5 

SACH *,1,AR2 : TMP=QR PR QI 2 
* 

LACC *,15,AR1 ; ACC := QI/2 PR QI 1 

ADD *,15 > ACC := (PR+QI)/2 PR QI 1 

SACH *+,0,AR2 ; PR := (PR+QI)/2 PI QI 2 

SUB *—, 16 : ACC := (PR-QI)/2 PI QR 2 

SACH *+,0,ARl ; QR := (PR—-QI)/2 PI QI 1 
* 

LACC *,15,AR5 ; ACC := (PI)/2 PI QI 5 

SUB *,15,AR1 ; ACC := (PI-QR)/2 PI QI 1 

SACH *+,0,AR5 ; PI := (PI-—QR)/2 PR+1 QI 5 

ADD *,16,AR2 ; ACC := (PI+QR)/2 PR+1 QI 2 

SACH e+ : QI := (PI+QR)/2 PR+1 QI+1 2 

SENDM 


* 
KHEKKKKKKEEKEEKEKKEKEKEEEEKEKKEEKEEKKEKEREKEKEKREKEREKEEKKKEEKEKEEKKKKEKKKEKKEKKKKKKKKKKKKKKEE 
* 


MACRO ‘PBY4J’ number of words: 16 


T=SIN(45)=COS(45)=W45 


t+ + + HF F 
+ ¢ ££ $F$ 


PR’= PR + (W*QI + W*QR) = PR + W * QI + W * QR (<— AR1) 
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* QR’= PR — (W*QI + W*QR) = PR — W * QI — W * QR (<— AR2) * 
* PI’= PI + (W*QI — W*QR) = PI + W * QI — W * QR ‘(<— AR1+1) * 
* QI’= PI — (W*QI — W*QR) = PI — W * QI + W * QR (<— AR1+2) * 
* * 
KHHKKEKKEKEKEKEEEKEKEEKEKEEEEKEEEEKKEKEKKEKKEKEKEKKREKEKKRKKKKKKRKKRKKRKEKKKRRKKRKKKRKKRKRRKRKEKREE 
eo 
PBY4J SMACRO ; TREG= W AR5 PREG AR1 AR2 ARP 
MPY *+,AR5 : PREG= W*QR/2 _ W*QR/2 PR QT 5 
SPH * ARI] ¢ TMP = W*QR/2 W*QR/2 W*QR/2 PR QI 1 
LACC *,15,AR2 ; ACC = PR/2 W*QR/2 W*QR/2 PR QOL 2 
MPYS * ; ACC = (PR—W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SPAC : ACC = (PR—W*QI-—W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SACH *+,0,AR1 ; QR = (PR—W*QI-—W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SUB *,16 : ACC = (—PR—W*QI—W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
NEG : ACC = (PR+W*QI+W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SACH e+ : QR = (PR+W*QI+W*QR)/2 W*QR/2 W*QI/2 PI QI 1 
tf 
LACC *,15,AR5 ; ACC = (PI)/2 W*QR/2 W*QI/2 PI QI 5 
SPAC : ACC = (PI-—W*QI)/2 W*QR/2 _ PI QI 5 
ADD *,16,AR2 ; ACC = (PI—W*QI+W*QR) /2 - _ PI QI 2 
SACH *+,0,AR1 ; QI = (PI—W*QI+W*QR) /2 _ _ PI QR1 1 
SUB *,16 ; ACCU= (—PI—W*QI+W*QR) /2 _ _ PI QR1 1 
NEG ; ACCU= (PI+W*QI-—W*QR) /2 - - PI QR1 1 
SACH *+,0,AR2 ; PI = (PI+W*QI—W*QR)/2 - _ PR1l QR1 2 
SENDM 
* 
HHH KH KKK KEKE AKER KEREEEEEEKEKKEKEKKEEKKKKEEKEEKKKKKKKEKKRE KKK 
* * 
* MACRO ‘P3BY4J’ number of words: 16 * 
* * 
* ENTRANCE IN THE MACRO: ARP=AR2 * 
* AR1—>PR, PI * 
* AR2—>QR,QI * 
* TREG=W=COS (45)=SIN(45) * 
* * 
* PR’= PR + (W*QI — W*QR) = PR + W * QI — W * QR (<— AR1) * 
* QR’= PR — (W*QI — W*QR) = PR — W * QI + W * QR (<— AR2) * 
* PI’= PI — (W*QI + W*QR) = PI — W * QI — W * QR (<— AR1+1) * 
* QI’= PI + (W*QI + W*QR) = PI + W * QI + W * QR (<— AR1+2) * 
* * 
* EXIT OF THE MACRO: ARP=AR2 * 
* AR1—>PR+1,PI+1 * 
* AR2—>QR+1,QI+1 * 
* * 
KEKE KKK KKK KKEKEEEKKEKEKKEKEKRKEKEKEKKKRKKKKRKEKKKKRKKK 
P3BY4J SMACRO ; TREG= W AR5 PREG AR1 AR2 ARP 
MPY *+,AR5 * PREG= W*QR/2 _ W*QR/2 PR QI 5 
SPH *, ARI] : TMP = W*QR/2 W*QR/2 W*QR/2 PR QI 1 
LACC *,15,AR2 s ACC = PR/2 W*QR/2 W*QR/2 PR QI 2 
MPYA *— ; ACC = (PR+W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SPAC : ACC = (PR—-W*QI+W*QR)/2 W*QR/2 W*QI/2 PR QR 2 
SACH *+,0,AR1 ; QR’ = (PR—W*QI+W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SUB *,16 3 ACC = (—PR—-W*QI+W*QR)/2 W*QR/2 W*QI/2 PR QT 1 
NEG ; ACC = (PR+W*QI—W*QR)/2 W*QR/2 W*QI/2 PR QI 1 
SACH w+ : PR’ = (PR+W*QI—W*QR)/2 W*QR/2 W*QI/2 PI QI 1 
a 
LACC *,15,AR5 : ACC = (PI)/2 W*QR/2 W*QI/2 PI QI 5 
APAC : ACC = (PI+W*QI)/2 W*QR/2 - PI QI 5 
ADD *,16,AR2 : ACC = (PI+W*QI+W*QR) /2 ~ _ PI QI 2 
SACH *0+,0,AR1 ; QI’ = (PI+W*QI+W*QR) /2 _ - PI QR5 1 
SUB *,16 ; ACCU= (—PI+W*QI+W*QR) /2 _ - PI QR5 1 
NEG : ACCU= (PI—W*QI—W*QR)/2 _ _ PI QR5 1 
SACH *0+,0,AR2 ; PI’ = (PI—W*QI—W*QR)/2 _- - PR5 QR5 2 
SENDM 


‘ 
tERRRAERELE EEE SESS EERE EERE ERE S EEE RE RE EERE RE RE RES E ESSERE EES E SERRE RESET ES SEES 


* 
* 
* 


MACRO ‘stage3’ 


number of words: 
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HEHEHE EREEEEEEEKKEKEEEKEEEKKKRKKKKEKEKKKKKKKKKKRKRKKKKKKKKKKKKKKKKKEKS 


stage3 


stage3e: 


* 


Smacro num 
SPLK #:num:—1,BRCR ; execute ’num’—1 times ‘stage3’ 
LT cos45 

RPTB stage3e 


ZEROI 
PBY4J 
PBY2I 
P3BY43 


-set $—1 


SENDM 


KHER EKEEEKEEEKEEKEKERKKEEKKKKEKKKKKKKKKKEKEKKKKKKRKKKKKKR KK KKK 


* 


+ ee € e+ + eH HF HF He SF SF HF FF 


* 


REKEKRKCEKEREEKKEEKKEEKEEKEEKEEKEEKREKEKEEKEEEKKEEKKEKEKEEKEKKKKEKKKEKRKKKKEKKKKKKKKRKKKRKRKRKK 


MACRO: 


THE MACRO 


Definition: ARP 


Definition: AR] 
Definition: AR2 
Definition: AR3 
Definition: AR4 
Definition: AR5 


PR’ = (PR+(QR*WR+QI*WI) )/2 


‘BUTTFLYI’ 


‘BUTTFLYI’ 


general butterfly radix 2 for 320C5x 


REQUIRES 18 WORDS 


—> AR2 (input) ARP —> AR2 


—> QR (input) AR1 —> QR+1 
—> PR (input) AR2 —> PR+1l 
—> Cxxx (input) AR3 —> Cxxx+l 
—> Sxxx (input) AR4 —> Sxxx+l 
—> temporary variable (unchanged) 


uses index register 


PI’ = (PI+(QI*WR-QR*WI) )/2 


QR’ 
QT’ 


BUTTFLYI $MACRO 


me OE 


RPTB btflyend 
LT *+,AR3 
MPY * , AR2 
LTP *—, AR4 
MPY *, AR3 
MPYA *+,AR2 
t 
LT *,AR5 
SACH *,1,AR1 
a 
ADD *,15 
SACH *+,0,AR5 
SUB *,16,AR2 
SACH *+,0,AR1 
0 
LACC *,15,AR4 
MPYS *+,AR2 
‘ 
APAC 
SACH *+,0,AR1 
NEG 
ADD *,16 
btflyend: 
SACH *+,0,AR2 
SENDM 


s end of file 


(PR—(QR*WR+QI*WI) ) /2 
(PI-—(QI*WR-QR*WI) ) /2 


WR=COS (W) 


(output) 


(output) 
(output) 
(output) 
(output) 


—> WR=cosine 
——-> WlI=sine 


WI=SIN(W) 


* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 


(contents of register after exec.) 


TREG 
:TREG:= QR QR 
;PREG:= QR*WR/2 QR 
sACC := QR*WR/2 QI 
;PREG:= QI*WI/2 QI 
sACC := (QR*WR+QI*WI) /2 QR 
PREG:= QI*WR 

;TREG = QR QR 
7HO := (QR*WR+QI*WI) QR 
sACC := (PR+(QR*WR+QI*WI))/2 QR 
sPR := (PR+(QR*WR+QI*WI))/2 OR 
sACC := (PR-(QR*WR+QI*W1I))/2 QR 
QR := (PR-(QR*WR+QI*WI))/2 OR 
;ACC := PI /PREG=QI*WR QI 
;PREG:= QR*WI/2 QI 
;ACC := (PI-QI*WR) /2 

sACC := (PI-(QI*WR-QR*WI))/2 QI 
QI := (PI-(QI*WR-QR*WI))/2 QI 
sACC :=(—PI+(QI*WR-OR*WI))/2 QI 
sACC := (PI+(QI*WR-OR*W1I))/2 QI 
PI := (PI+(QI*WR-QR*W1))/2 QI 


AR1 AR2 AR3 AR4 ARP 


PR QI 
PR QI 
PR QR 
PR QR 
PR QR 
PR QR 
PR QR 
PR QR 
PI QR 
PI QR 
PI QI 
PI QI 
PI QI 
PI QI 
PI QR+1 
PI QR+1 
PI QR+1 
PR+1 QR+1 
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Example 7-26. Initialization Routine 


* 
* file: INIT-—FFT.ASM 
* 
* Initialized variables 
* 
-bss NN,1 snumber of fft—points 
-bss NN2,1 °2*N—1 
-bss DATAADD, 1 sSTART ADDRESS OF DATA 
-bss cos45,1 
-bss sin4,1 sstart of sine in stage 4 
-bss cos4,1 ;start of cosine in stage 4 


* Temp variables 
* 


-bss TEMP , 2 sused for temporary numbers 
* 
-sect "vectors” 
B INIT, *,ARO 
-sect "init” 
-word 5A82h :cos(45)=sin(45) 
word TWID,TWID+4 
TABEND: -set $ 
* 
INIT: LDP #0 ;use only B2 and mmregs for direct addressing 
SPM 0 sno shift from PREG to ALU 
CLRC OVM :disable overflowmode 
SETC SXM ;enable sign extension mode 


SPLK #pmstmask,PMST :ndx=trm=1 


* INIT Block B2 
* 
LAR ARO, #NN sarp is already pointing to ar0 
LACC #TABINIT 
RPT #TABEND—TABINIT 
TBLR + 


+ 


INIT TWIDDLE FACTORS 


LAR ARO, #TWID sarp is already pointing to ar0 
LACC #TWIDSTRT 

RPT #TWIDLEN 

TBLR *+ 


* EXECUTE THE FFT 


LAR AR5 , #TEMP spointer to 2 temp register 
CALL FFT, *,AR3 ;ARP=AR3 FOR MACRO COMBO 

* 

WAIT RET ;Return 
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Example 7-27. 16-Point Radix-2 Complex FFT 


-file "c5cx0016.asm” 

-title "0016 point DIT Radix—2, Complex FFT” 

-width 120 
N -set 16 ; NUMBER OF POINTS FOR FFT 

-mmregs 
pmstmask .set 0110b ; ndx=trm=1 
HHKKKHKKRAKKAE AHA KEKE KEKE KEKE KEKEEKEEKEEKEEKKEREEKERKKEKRKKEKKKEKEKKKKKKEKKEK 
* * 
* 16 — POINT COMPLEX, RADIX~2 DIF FFT WITH THE TMS320C5x / LOOPED CODE * 


* Pe Oa ene PE RU Nae ne RRR Pele ear eC AE SE el 
TECTCTCCTCOCTOCSSSOCCOCT OCCT CSCC CSS SSS EPP PPP ee ee eee eee eee es 
* THE PROGRAM IS BASED ON THE BOOK ‘CIGITAL SIGNAL PROCESSING APPLICATIONS’ * 
* FROM TEXAS INSTRUMENTS P. 69. IT IS OPTIMIZED FOR THE TMS320C5x INCLUDING * 


* BIT REVERSAL ADDRESSING MODE. * 
* * 
Hee Fe He He He HH HHI HH KIKI RII RII KI IK KIKI KT IIT KIKI IKE IKKE KEKE IKKE KEKE EERE KKK 
* * 


USED REGISTERS: INDX,AR1,AR2,AR3,AR4,AR5,ACCU,PREG,TREGO, PMST, BRCR 
2 Stacklevel, Block B2 for temp variables 


PROGRAM MEMORY: 164 WORDS (‘’END’ — ‘FFT’) WITHOUT INITIALIZATION 


COEFFICIENTS : 16 BITS (Q15 Format) SCALING: 1/2*4 


PROGRAM SEQUENCE: 0. INITIALIZATION FOR FFT/COEFF ADD: 240H — 20BH 
1. INPUT NEW DATA INTO ‘INPUT’ ADD: 220H — 23FH 
2. CALL SUBROUTINE FFT ADD: 600H — 6A3H 
2.1. BITREVERSAL FROM INPUT TO DATA ADD: 200H — 21FH 
2.2. FFT WITH WORK SPACE DATA ADD: 200H — 21FH 
3. OUTPUT THE RESULTS FROM DATA ADD: 200H — 21FH 


INPUT DATA AT ADDRESS 0220h—023fh: 


THE DATA IS STORED IN ‘INPUT’ AS THE SEQUENCE: X(0),X(1),---,X(15) 
Y¥(0),¥(1),---,¥(15) 


OUTPUT DATA AT ADDRESS 0200h—021fh: 


THE DATA IS STORED IN ‘DATA’ AS THE SEQUENCE: 
X(0),¥(0),X(1),¥(1),.056 «oe ,X(15),¥(15) 


HEKEKKKKEKREKEEKKEKEEKEKHKEEEKEKRKEKERKKEREEKEEKEKKEEKEKREKREKEKEEEKEKEKEKEKKKRKEKKKEKEKKE 


THIS PROGRAM INCLUDES FOLLOWING FILE: 


THE FILE ‘TWIDDLES.Q15’ CONSISTS OF TWIDDLE FACTORS IN Q15 FORMAT 
THE FILE ‘'C5CXRAD2.MAC’ macro files 


THE FILE ‘INIT-FFT.ASM’ for initialization 


* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
* 
KEKKKKKKEKEKKEKEEEKEKEKEKEEKEREEEEKEKEKKEREEEEEKKKEKEEKKEKKKKKEKKKKKKKKRKKKEKKKKKEKKKKK 


e*eeete te ee ee + + + ee HH HF HH eH HH HE HH HE HF HH FF 


- include C5CXRAD2.MAC 


def TWIDLEN, FFTLEN, TEMP, WAIT,cos45 

def INIT, FFT, TWIDSTRT, TWIDEND 

.def STAGE1,STAGE3 , STAGE4 , INPUT, DATA, TWID 
? 

-sect "twiddles” 


s table of twiddle factors for the FFT 
TWIDSTRT set $ 
- include twiddles.q15 


TWIDEND -set $ 

TWIDLEN -set TWIDEND—TWIDSTRT 

* 

INPUT -usect “input” ,N*2 sinput data array 

DATA ~usect "data”,N*2 ;working data array 

TWID -usect "twid” ,N*2 sreserve space for twiddles 
* 

* - include init—fft.asm 


* 
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-sect "fftprogram” 
* 


* FFT CODE WITH BIT—REVERSED INPUT SAMPLES / ARP=AR3 
* 


FFT: LAR AR3 , DATAADD ;TRANSFER 32 WORDS FROM ‘input’ to ‘data’ 
LACC NN 
SAMM INDX ;indexregister=7 
RPT NN2 ;N TIMES 
BLDD #INPUT, *BRO+ 

* 

* FFT CODE for STAGES 1 and 2 

* 

STAGE1: SPLK #7 , INDX sindexregister = 7 
LAR AR1,DATAADD ;pointer to DATA rl1,il 
LAR AR2 , #DATA+2 ;pointer to DATA + 2 r2,i2 
LAR AR3 , #DATA+4 spointer to DATA + 4 r3,i3 
LAR AR4,#DATA+6 ;pointer to DATA + 6 r4,i4 
COMBO5X 4 srepeat 4 times 


* 


* FFT CODE FOR STAGE 3 / ARP=AR2 
* 


STAGE3: SPLK #9 , INDX sindex register = 9 
LAR AR1,DATAADD sarl —> DATA 
LAR AR2,#DATA+8 sar2 —> DATA+8 
stage3 2 ;repeat 2 times 
* 
* FFT CODE FOR STAGE 4 / ARP=ARP 
* 
STAGE4: SPLK #1, INDX s;index register = 1 
LAR AR1,DATAADD 
LAR AR2 , #DATA+16 
LAR AR3,cos4 sstart of cosine in stage 4 
LAR AR4,sin4 :start of sine in stage 4 
SPLK #6,BRCR 
ZEROI s;execute ZEROI 
BUTTFLYI sexecute 7 times BUTTFLYI 
RET 
END: .set $ 
FFTLEN -set END—FFT+1 
-end 
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_Electrical Specifications 


This appendix contains data sheet information on the TMS320C5x digital sig- 
nal processors family, including the following devices: 


[) TMS320C50 
[) TMS320051 
[) TMS320C53 


Figure A—1 shows the pinout of the ’C5x devices in a 132-pin quad flat pack; 
the pin assignments are given in Table A-1. This appendix also contains the 
electrical characteristics of the 'C5x devices and the mechanical data of the 
132-pin quad flat pack. 


Topic Page 


Pinout and Signal Descriptions 


A.1 Pinout and Signal Descriptions 


Figure A—1. TMS320C5x Pinout 


132-Pin 
Quad Flat Pack Packaget 
(Top View) 


51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 76 78 80 81 82 8&8 


SS ES2Ez eS ao B2SSassesSee233s az & |S 
z = 33 * <i bs “28 BP Bo 2G ZEEE e222 ask 
> Fg < > 
) 


t See Pin Assignments, Table A—1 (page A-3) for location and description of all pins. The ‘C50, ’C51, and 
‘C53 are packaged in 132-pin plastic QFP in production. See Figure A—20 for mechanical data. 
Note: NC = No connect. (These pins are reserved.) 
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Pinout and Signal Descriptions 


Table A—1. TMS320C5x Pin Assignments 


Type Description 


O/Z Instruction Acquisition | 
JTAG Test Reset 


” 
” 


if 


SS 


fe 
N 


ok 


“J ee 
‘o) 


D11 /O/Z 


pis (ss) | voz 


11 D10 

2 | o | wz | 

rs | oe SC«TC 
r 00 Supply 

NCI 

NCI 

NO! 

NC 

Supply Ground | - 

Supply 


Parallel Data Port, Low-Byte (8 pins) 


fe 
N 


as 


—_ 
Olroriniao 


wok, 


Pe 
14] 


< 
” 
+ | 


_ 
~“ 
+ 


NC 
VO/z 
yo/z 
yO/z 

.  yo/z 
Vo/z 

 Yo/z 
yO/z 
vO/z 


OD 


NO nN 


Do (LSB) 
M 


1 


o) 


< 
) 
o 


< a 
es) 
oO 


TCK 


t NC = No connect 
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Table A~1. TMS320C5x Pin Assignments (Continued) 


Interrupt #3 © 

Interrupt #4 

Nonmaskable Interrupt 
Serial Port 1 Data Receive 
Serial Port 2 Data Receive 


Parallel Port Address Bus 
(10 pins) 


t NC = No connect 
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Description 
Supply | Ground 


3 

@® 
” 
Cc 
A?) 
Ss 
< 


PS) 
z 
Q 


Reserved 


Clock Mode Pin 1 


Parallel Port Address Bus 
(6 pins) 


awh, 


N 


“N 
— 
B|z]o 
~Iio 
le 
N 


s]~ 
a] s 
= 
w |r 
= 
O 
N 


N 


Q 


Ni 

“Ni 
z|z|>|>|> 
pad Oo; > 

re) 

N 


a 
[J resoved SOS 
Read Enable 
Write Enable 
pd Reseed 


“Nl 
© 


<j|Ziz 
~i + 


“i 
o) 
S 
819 
c 
A) 
2. 
< 


< 
s) 
0 

c 
oS 
oS 
< 


wey | Ground OSS 


(07 
(Oz 
V07 
V0 Sun 


© | © 
N“ 
oe 
” 
Cc 
ne] 
oS 
< 


8 


CLKIN2 
X2/CLKIN 


Q 


coxa 


TDO O/Z JTAG Scan Output 
Tt NC = No connect 


A-5 


—h 
“N 


Hold Acknowledge | 


Machine Clock Output 


anh, 
—_ 
© 


—h, 
_ 
ie) 


—_ 
om. 
eels 


—_—t § ot 
re are 
& | 6 


— 
— 
o 


Ss 
onb, 
“Ni 


=i 
— 
foe) 


VO/Z | Emulator Interrupt 1 
Supply Ground 


oh, 


oh, 
Nn 
N 


oni, 
N 
© 


t NC = No connect 
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A.2 Electrical Characteristics and Operating Conditions 


Table A—2.Absolute Maximum Ratings Over Specified Temperature Range (Unless 


Otherwise Noted)t 
Supply voltage range, Vop Fo... ccc cc ce cece nent e eee e eee te eeeeeees —-0.3Vto7V 
INDUT VONADCTANGG sce cs eA cia ce (ws ee awe eet okie ett bean toe Ses O0.3Vto7V 
Output voltage ange: cs 6024 tadasoeeesesa tedious Staak oie es aenteee es -O3Vto7V 
Operating case temperature range ......... ccc cece cece t eee tenn ene eenenes 0° to 85°C 
Storage temperature range ....... ccc cee eee eee eet e een eee e eens —55° to 150°C 


t Stresses beyond those listed under “Absolute Maximum Ratings” may cause damage to the device. This is a stress rating only, 
and functional operation of the device at these or any other conditions beyond those indicated in the “Recommended Operating 
Conditions” sections of this specification is not implied. Exposure to absolute-maximum-rated conditions for extended periods 
may affect reliability. 

+ All voltage values are with respect to Vs. 


Table A-3. Recommended Operating Conditions 


(a 
Vss Supply voltage 


CLKX,CLKR, TCLKX, TCLKR | 2.5 Vpp+0.3 


All others 2.0 Vpp+0.3 
Vit Low-level input voltage 0.3 0.8 V 
lo High-level output current -300T | pA 


T Operating case temperature | °C | 


t This Io4 may be exceeded when using a 1-kQ pull-down resistor on the TDM serial port TADD output, however, this output still 
meets Vox specifications under these conditions. 


ait 
vy 
iL aA I 
a 
m 


V 
V 
V 
A 
°C 


Table A—4. Electrical Characteristics Over Specified Free-Air Temperature Range (Unless 
Otherwise Noted) 


Three-state current 


(Vpp = Max) All other three-state | -—20 t 20 


Input current TRST pin (with internal pulidown) | -—10 ¢ 800 |} pA 
(Vi=Vss to Vpp) TMS, TCK, TDI pins (with internal pullups) —400 t 10 


| X2/CLKIN pin —50 + +50] pA 
All other input-only pins 10 | 


Iz 


t 10 
lopc Supply current, core CPU Operating Ta=25°C, Vpp=5.25 V, f,=40.96 MHz 


lpbpp § Supply current, pins Operating Ta=25°C, Vpp=5.25 V, f,=40.96 MHz 


Ino Supply current, standby —_| IDLE2, clocks shut off 


[c\___Input capacitance Oe 
Cy Output capacitance ee 
t All typical nominal values are at Vpp=5 V, Ta=25°C. 

+ These values are not specified, pending detailed characterization. 


§ All input and output voltage levels are TTL-compatible. Figure A-2 shows the test load circuit and Figure A-3 shows the voltage 
reference levels. 


Figure A—2. Test Load Circuit 


Tester Pin 
Electronics 


Output 
Under 
Test 


VLOAD 


Where: lo, = 2.0 mA (all outputs) 
loH = 300 pA (all outputs) 
Vioap = 1.5V 
Cr = 80 pF typical load circuit capacitance. 


TTL output levels are driven to a minimum logic-high level of 2.4 volts and to 
a maximum logic-low level of 0.6 volt. Figure A-3 shows the TTL-level outputs. 
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Figure A—3. TTL-Level Outputs 


TTL-output transition times are specified as follows: 


[J For a high-to-low transition, the level at which the output is said to be no 
longer high is 2.0 volts, and the level at which the output is said to be low 
is 1.0 volt. 


J) For a low-to-high transition, the level at which the output is said to be no 
longer low is 1.0 volt, and the level at which the output is said to be high 
is 2.0 volts. 


Figure A—4 shows the TTL-level inputs. 


Figure A-4. TTL-Level Inputs 


TTL-compatible input transition times are specified as follows: 


.j Fora high-to-low transition on an input signal, the level at which the input 
is said to be no longer high is 2.0 volts, and the level at which the input is 
said to be low is 0.8 volt. 


[] Fora /low-to-high transition on an input signal, the level at which the input 
is said to be no longer low is 0.8 volt, and the level at which the input is said 
to be high is 2.0 volts. 
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A.3 Clock Characteristics and Timing 


The ’C5x can use either its internal oscillator or an external frequency source 
for a clock. The clock mode is determined by the CLKMD1 (pin 71) and 
CLKMD2 (pin 103) clock mode pins. The following table outlines the selection 
of the clock mode by these pins. 


CLKMD1 | CLKMD2 | Clock Source 
ee ae External divide-by-one clock option. | 
i ae Reserved for test purposes. 


{ External divide-by-two option or internal divide-by-two clock option 


with an external crystal. 


ee External divide-by-two option with the internal oscillator disabled. 


A.3.1_ Internal Divide-by-Two Clock Option With External Crystal 


The internal oscillator is enabled by connecting a crystal across X1 and 
X2/CLKIN. The frequency of CLKOUT1 is one-half the crystal’s oscillating fre- 
quency. The crystal should be in either fundamental or overtone operation and 
parallel resonant, with an effective series resistance of 30 ohms and a power 
dissipation of 1 mW; it should be specified at aload capacitance of 20 pF. Note 
that overtone crystals require an additional tuned-LC circuit. Figure A-4 shows 
an external crystal (fundamental frequency) connected to the on-chip oscilla- 
tor. 


Table A-5. Recommended Operating Conditions 


Parameter 


TMS320C5x-40 
f Input clock frequency T™MS320C5x-57! 


t This device utilizes a fully static design and therefore can operate with tc(cl) approaching ». The device is characterized at fre- 
quencies approaching 0 Hz but is tested at a minimum of 3.3 MHz to meet device test time requirements. 

* Other timings for the 57-MHz CLKIN devices are the same as those for the 40-MHz CLKIN devices, except where otherwise 
indicated. 
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Figure A—5. Internal Clock Option 


A.3.2. External Divide-by-Two Clock Option 


An external frequency source can be used by injecting the frequency directly 
into X2/CLKIN, with X1 left unconnected, CLKMD1 set high, and CLKMD2 set 
high. This external frequency is divided by two to generate the internal ma- 
chine cycle. 


The external frequency injected must conform to specifications listed in the 
timing requirements table. 


Table A—6. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5 &co)) 


Parameter Min Typ 


eee TMS320C5x-40 48.8 2te(c) t 
pene ree TMS320C5x-57! 35 alg 


tc(CO) 


t This device utilizes a fully static design and therefore can operate with tc(cl) approaching ~. The device is characterized at fre- 
quencies approaching 0 Hz but is tested at a minimum of 3.35 MHz to meet device test time requirements. 

+ Other timings for the 57-MHz CLKIN devices are the same as those for the 40-MHz CLKIN devices, except where otherwise 
indicated. 
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Table A—7. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5 tycoy) 


CLKIN high pulse durati TMS320C5x-40 
iabceneeiee at TMS320C5x-57+ 


t Values derived from characterization data and not tested. 

+ Other timings for the 57-MHz CLKIN devices are the same as those for the 40-MHz CLKIN devices, except where otherwise 
indicated. 

§ This device utilizes a fully static design and therefore can operate with tect) approaching ». The device is characterized at 
frequencies approaching 0 Hz, but is tested at a minimum of 6.7 MHz to meet device test time requirements. 


tw(CIH) 


Figure A—6. External Divide-by-Two Clock Timing 
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A.3.3 External Divide-by-One Clock Option 


An external frequency source can be used by injecting the frequency directly 
into CLKIN2, with X1 left unconnected and X2 connected to Vpp. This external 
frequency is divided by one to generate the internal machine cycle. The divide- 
by-one option is used when the CLKMD1 pin is strapped high and CLKMD2 
is strapped low. 


The external frequency injected must conform to specifications listed in the 
timing requirements table. 
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Table A—8. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5 & (coy) 
Parameter Typ 


iG 
a 
coy CLKOUTIrisotime SS SSC*” 


Transitory phase—PLL synchro- 
b nized after CLKIN2 supplied 


t Values derived from characterization data and not tested. 
+ Other timings for the 57-MHz CLKIN devices are the same as those for the 40-MHz CLKIN devices, except where otherwise 


indicated. 
§ Clocks can be stopped only while the device executes IDLE2 when using the external divide-by-one clock option. 
1 Values guaranteed by design and not tested. 


Table A-9. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5 tco)) 


Unit 
TMS320C5x-40 
tech CLKIN2 cycle time | | ons 
[twsszocses! [35 ns 
cee ee ee 
ey Nee ine ee ee 
; TMS320C5x-40 15 60 | ns | 
twi(CIL) CLKIN2 low pulse duration TuSaOCETaTt 
TMs320C5x-40 | 15 ts 
LKIN2 high 7 |_TMS320C5x-40 | | 
twiciHy | CLKIN2 high pulse duration Saaasqneaay: 


t Values derived from characterization data and not tested. 

+ Other timings for the 57-MHz CLKIN devices are the same as those for the 40-MHz CLKIN devices, except where indicated 
otherwise. 

§ Clocks can be stopped only while the device executes IDLE2 when using the external divide-by-one clock option. Note that to 
(the transitory phase) will occur when restarting clock from IDLE2 in this mode. 


Figure A—7. External Divide-by-One Clock Timing 
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A.3.4 Memory and Parallel I/O Interface Read Timing 


Table A-10. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5tco) 


[Parameter SSOSCSCS~SCSCSCSMN SSC ~SCsdit 
Teun Setup ime, address valbeforeRDlow? «| H-10)SSSCSC~iSCSCi 
Taw __Holdtime, address valid ater AO igh? S| SSCSCSC~dSCi 
wey FD low pulse duration SSSCSdT SH SSCS~*~‘ SY 
tar Delay time, AD high to WE low p2H-5 i“ ™~—“‘LSCss S| 
t A15-A0,PS, DS, IS, and BR timings are all included in timings referenced as address. 

+ STRB and RD rising and falling edges track and are 0O—4 and + 2 ns, respectively, from CLKOUT1 edges on reads, following the 
cycle after reset, which is always 7 wait states; thus, tolerance of resulting pulsewidths is + 2 ns, not + 4 ns. See Appendix B. 


# Values derived from characterization data and are not tested. 
1 See Figure A-9 for address bus timing variation with load capacitance. 


Table A—11. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5f(co)) 


Parameter | 


TMS320C5x-40_ | 
Read data access from address valid 


| TMS320C5x-57+ 


taa) 


tsu(D)R Read data setup time before RD high 
thip)R Read data hold time after RD high 
tair) Read data access time after RD low 


t See Figure A-9 for address bus timing variation with load capacitance. 
+ Other timings for 57-MHz CLKIN devices are the same as for the 40-MHz devices, except where indicated otherwise. 


A.3.5 Memory and Parallel I/O Interface Write Timing 


Table A-12. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5t(co)) 


Parameter 


Setup time, address valid before WE low t 

Hold time, address valid after WE high t 

WE low pulse duration #4 | 

WE high pulse duration ** 

Delay time, WE high to RD low 
tsu Bi Setup time, write data valid before WE high # 
Hold time, write data valid after WE high ¢ 
ten(D Enable time, WE to data bus driven _ 


t A15—A0,PS, DS, IS, R/W, and BR timings are all included in timings referenced as address. 


+ STRB and WE edges are 0-4 ns from CLKOUT1 edges on writes. Rising and falling edges of these signals track each other; 
tolerance of resulting ucanaelel is + 2 ns, not + 4ns. See Appendix B for logical device interface timings. 
1 Values derived from characterization data and are not tested. 


§ This value holds true for zero or one wait state only. 
# See Figure A-9 for address bus timing variation with load capacitance. 
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Figure A-8. Memory and Parallel |/O Interface Read and Write Timing 


Note: All timings are for 0 wait states. However, external writes always require two cycles to prevent external bus conflicts. 
The above diagram illustrates a one-cycle read and a two-cycle write and is not drawn to scale. All external writes 


| | oe tsu(A)w ee thiayw 
| 


| | | 
RW a aia) — \ | | / 
t 
ta(r) <> eh | 


immediately preceded by an external read or immediately followed by an external read require three machine cycles. 


Figure A—9. Address Bus Timing Variation With Load Capacitance 
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A.3.6 Ready Timing for Externally Generated Wait States 


Table A-13. Timing Requirements Over Recommended Operating Conditions 


tsu(R-CO READY setup time before CLKOUT1 rises 
thico-R READY hold time after CLKOUT1 rises 


tsu(R)R READY setup time before RD falls 
thiRR READY hold time after RD falis 
twRyw READY valid after WE falls 


thmiw READY hold after WE falls 


Note: The external READY input is sampled only after the internal software wait states are completed. 


Figure A-10. Ready Timing for Externally Generated Wait States During an External 


Read Cycle 
CLKOUT1 | | | | | | | 
| | | 
teun_coy > ke | pl ke tunco | 


| | 
| | | 


| | — + thico-A) 


OT a : Fs eee: tammaian | 
READY | | | 
| | Wait state 


| Generated 
tsu(RR PI 2 | Wait state | by READY | 


Generated 
| internally 
RD | | thiyR 


NO { 
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Figure A-11. Ready Timing for Externally Generated Wait States During an External Write 
Cycle 


CLKOUT1 | | | | | | | | 


k¢—>}—| thir-co) | 
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tsuR-co) ke | | | 
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| | 

| 
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| 
WRW le 
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| 
| | 
| 
l4— thaw 
WE | i | 


| | 
Wait State Generated by READY See 


A.3.7 Reset, Interrupt, and BIO Timings 


Table A-14. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5t(co)) 


Parmer Ma [UR 

Reuiny __WTT-INTA, NWI, RS setup timo before CLKOUTIIw? ——s«dY Sid 

Rng) INTT-INT4, NWI, FS hold tne afer CLKOUTIIow? ———Ss~dCSOSC~dCSs 

we NTT-INTA, NMTTow pulse duration, synchronous SY 

Reugy FS set uptime before X2/0LKINow ——————SSSCSCS—~dSCTO 
5 


twiBl)s BIO low pulse duration, synchronous a aa 
twiBia BIO low pulse duration, asynchronous * 
tsu(BI BIO setup before CLKOUT1 low 


BIO hold time after CLKOUT1 low 


thiBl 


t These parameters must be met to use the synchronous timings. Both reset and the interrupts can operate asynchronously. The 
pulse widths require an extra half-cycle to guarantee internal synchronization. 

+ If in IDLE2, add 4H to these timings. 

# Values derived from characterization data and are not tested. 
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Figure A-12. Reset, Interrupt, and BIO Timings 
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A.3.8 Instruction Acquisition ([AQ), Interrupt Acknowledge ([ACK), External 
Flag (XF), and TOUT Timings 


Table A-15. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5t(co)) 


w(TOU TOUT pulse width 2H — 12 | 


ty Delay time, XF valid after CLKOUT1 | 


t TAQ goes low during an instruction acquisition. It goes low only on the first cycle of the read when wait states are used. The 
falling edge should be used to latch the valid address. The AVIS bit in the PMST register must be set to zero for the address 
to be valid when the instruction being addressed resides in on-chip memory. 

+ TACK goes low during the fetch of the first word of the interrupt vector. It goes low only on the first cycle of the read when wait 
states are used. Address pins A1 — A4 can be decoded at the falling edge to identify the interrupt being acknowledged. The 
AVIS bit in the PMST register must be set to zero for the address to be valid when the vectors reside in on-chip memory. 

1 Valid only if the external address reflects the current instruction activity (that is, code is executing on chip with no external bus 
cycles and AVIS is on or code is executing off-chip). 
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Figure A-13. IAQ, IACK, and XF Timings Example With Two External Wait States 
oe ee ee ee th(ayIAQ ——— 


| 
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| 
«+———>- tsu(A)IAQ | 


| tw(lAQL) | 


mee Ak 
tsu(A)IACK 
TACK \ V 
| 
twIACKL) 391 
STRB ee 


| | le 3 
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turoun —1¢—> | l 


XF 


TOUT / | \ 


(*——  twrout) ———> 


Note: IAQ and IACK are not affected by wait states. 
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A.3.9 External DMA Timing 


Table A-16. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5tco)) 


te(M-HA Address three-state before HOLDA lowt 
enin-ay Enable time, HOLDA highto aderess arven | =e 
2H" 


hae 

a 

re 

ne 

= re 

: es 8 
faery _Delaytine, KER righ OG gh Ene 
3 re 

i Te 

sa ate 

re 

rs 


ty 
pyr Delay tne, read data valid ater XSTRBTow [0 
nop Read data hold time after XSTRB igh [0 
2(I-D TAQ high to data three-state PH 


t This parameter includes all memory control lines. 

+ This parameter refers to the delay between the time the condition (AQ = 0 and XR/W = 1) is satisfied and the time that the ’C5x 
data lines become valid. 

$ HOLD is not acknowledged until current external access request is complete. 

1 Values derived from characterization data and are not tested. 


- 


ori ee 


Note: X preceding a name refers to external drive of the signal. 


Table A-17. Timing Requirements Over Recommended Operating Conditions 


Li a | SE 
taping) Delay time, HOLDA ow to XBR low FT [oe 
tag-xs) Delay time, TAD low toXSTRB low FoF 

| ee 


su(XD)W Setup time, Xdata valid before XSTRB low 


Unit | 
ns 
| ns 
pons 
| ons 
RWD)W Hold time, Xdata hold after XSTRB low ee ee 
pons 
ns 
ns 
pons 


ori =~ 


iwasuy Wath KSTRB tw pulse sO 
x34) With XSTFB igh pulse i 


-- 


thoxayw Hold time, Write Xaddress hold after XSTRB low 
u 


tsucxs)RW Setup time, R/W valid before XSTRB low 
thoxayR Hold time, read Xaddress after XSTRB high 


t XBR, XR/W, and XSTRB lines should be pulled up with a 10-kQ resistor to assure that they are in an inactive high state during 
the transition period between the TMS320C5x driving them and the external circuit driving them. 

4 Values derived from characterization data and are not tested. 

Note: X preceding a name refers to external drive of the signal. 
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Figure A-14. External DMA Timing 
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A.3.10 Serial Port Receive Timing 


Table A-18. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5tco)) 


Co. ae! SAL Jee 
sci Serial port clock isetime SSS 
euies) FSR setup time before GLKA faling edge —S—SC~d 17 SC—~S 
oS 
ez | 
a 


thirs “FSR hold time after CLKR falling edge 
tsu(DR) DR setup time before CLKR falling edge 
thior DR hold time after CLKR falling edge 


+ The serial port design is fully static and therefore can operate with tciscK) approaching ~. It is characterized approaching an 
input frequency of 0 Hz but tested at a much higher frequency to minimize test time. 
1 Values derived from characterization data and are not tested. 


Figure A-15. Serial Port Receive Timing 
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A.3.11 Serial Port Transmit Timing of External Clocks and External Frames 
(see Note) 


Table A—19. Switching Characteristics Over Recommended Operating Conditions 
(S = 0.5t(scK) 


[cla | |e 
tao Delay time, DX valid after CLKX rising Pe ce ee | 
tuis(D Disable time, DX after CLKX rising ee 


th(Dx) Hold time, DX valid after CLKX rising 
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Table A-20. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5t(coy) 


usc Serial portlocktfaltime SCS SSCS 
[usc Serial port lockisotime SCS SSE 


tu/Fs FSX delay time after CLKX rising edge PF HBF 


thiFs FSX hold time after CLKX falling edge = 
tniFS)H FSX hold time after CLKX rising edge 


t If the FSX pulse does not meet this specification, the first bit of serial data will be driven on the DX pin until the falling edge of 
FSX. After the falling edge of FSX, data will be shifted out on the DX pin. The transmit buffer empty interrupt will be generated 
when the thFs) and th(Fs)H Specification is met. 

+ The serial port design is fully static and therefore can operate with te(sck) approaching ~. It is characterized approaching an 
input frequency of 0 Hz but tested at a much higher frequency to minimize test time. 

1 Values derived from characterization data and are not tested. 

Note: _ Internal clock with external FSX and vice versa are also allowable. However, FSX timings to CLKX are always defined 
depending on the source of FSX, and CLKX timings are always dependent upon the source of CLKX. Specifically, the 
relationship of FSX to CLKX is independent of the source of CLKX. Table A-20 shows external FSX and external CLKX 
timings; Table A-21 shows internal FSX and internal CLKX timings. 


Figure A-16. Serial Port Transmit Timing of External Clocks and External Frames 
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A.3.12 Serial Port Transmit Timing of Internal Clocks and Internal Frames 
(see Note) 


Table A—21. Switching Characteristics Over Recommended Operating Conditions 
(H = 0.5&(co), S = 0.5te(scK)) 


thio Hold time, DX valid after CLKX rising 


Note: Internal clock with external FSX and vice versa are also allowable. However, FSX timings to CLKX are always defined 
depending on the source of FSX, and CLKX timings are always dependent upon the source of CLKX. Specifically, the 
relationship of FSX to CLKX is independent of the source of CLKX. Table A-20 shows external FSX and external CLKX 
timings; Table A-21 shows internal FSX and internal CLKX timings. 


Figure A—17. Serial Port Transmit Timing of Internal Clocks and Internal Frames 
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A.3.13 Serial Port Receive Timing in TDM Mode 


Table A-22. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5&(co)) 


Parameter 


h(FS TREM hold time after TCLK rising edge * 


t These parameters apply only to the first bits in the serial bit string. 

+ TFRM timing and waveforms shown in Figure A—18 are for external TFRM. TFRM can also be configured as internal. The TFRM 
internal case is illustrated in the transmit timing diagram in Figure A-19. 

§ The serial port design is fully static and therefore can operate with tc(sck) approaching ~. It is characterized approaching an 
input frequency of 0 Hz but tested at a much higher frequency to minimize test time. 

#* Values derived from characterization data and are not tested. 


Figure A—18. Serial Port Receive Timing in TDM Mode 
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A.3.14 Serial Port Transmit Timing in TDM Mode 


Table A-23. Switching Characteristics Over Recommended Operating Conditions 
(S = 0.5t(scK)) 


t These parameters apply only to the first bits in the serial bit string. 
+ TFRM timing and waveforms shown in Figure A—19 are for internal TFRM. TFRM can also be configured as external, and the 
TFRM external case is illustrated in the receive timing diagram in Figure A—18. 


Table A-—24. Timing Requirements Over Recommended Operating Conditions 
(H = 0.5t(co)) 


Parameter 


twisc Serial port clock low/high pulse duration 


tT When SCK is generated internally. 

+ The serial port design is fully static and therefore can operate with tc(Sck) approaching ~. It is characterized approaching an 
input frequency of 0 Hz but tested at a much higher frequency to minimize test time. 

# Values derived from characterization data and are not tested. 


Figure A-19. Serial Port Transmit Timing in TDM Mode 
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Mechanical Data 


A.4 Mechanical Data 


Figure A-20. 132-Pin Quad Flat Pack Plastic Package 


4,45 (0.175) 
0,254 (0.010) Nom 4,19 (0.165) 
0,635 (0.025) Nom al 0,76 (0.030) Nom > 
t 


VA 


24,18 (0.952) 
24,08 (0.948) 


27,56 (1.085) 
27,31 (1.075) 


ll t 
WW \A 
24,18 (0.952) 

24,08 (0.948) 


27,56 (1.085) 
27,31 (1.075) 


Thermal Resistance Characteristics 


ar 


ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 


Note: The contact points are within 0,15 (0.006) of being planar. 
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Appendix B 


_External Interface Timings 


This appendix discusses functional timing operations on the external memory 
interface bus. Detailed timing specifications for all 'C5x signals are contained 
in Appendix A, Electrical Specifications. 


The 'C5x memory is organized into four selectable spaces: program, local 
data, global data, and I/O space. These spaces are multiplexed through a 16- 
bit data bus and a 16-bit address bus. Each space is selected by its corre- 
sponding select signal: data select (DS), program select (PS), and I/O space 
select (IS). Global data memory accesses are distinguished by the bus re- 
quest (BR) pin. The read and write diagrams shown apply to accesses to all 
spaces. 


B-1 


B.1 Read/Write Timings 


All bus cycles comprise integral numbers of CLKOUT1 cycles. One CLKOUT1 
cycle is defined to be from one falling edge of CLKOUT1 to the next falling edge 
of CLKOUT1. For full-speed, zero-wait state operation, reads require one 
cycle and writes require two cycles. A write immediately preceded by a read 
or immediately followed by a read requires three bus cycles. 


For read cycles, STRB goes low and ADDRESS becomes valid with the falling 
edge of CLKOUT1. The RD signal then goes low with the rising edge of 
CLKOUT1 and goes high again at the next falling edge of CLKOUT1 (for zero 
wait-states read cycles). For one more wait state (multicycle) read, RD stays 
low but goes high again with the falling edge of CLKOUT1 before the next 
cycle, even if the cycles are contiguous. Read data is sampled at the rising 
edge of RD. 


The R/W signal goes high at least one half CLKOUT1 cycle before any read 
cycle; for contiguous read cycles, STRB stays low. At the end of a read cycle 
or sequence of reads, STRB goes high along with RD on the falling edge of 
CLKOUT1. 


Write cycles always have at least one inactive (pad) cycle of CLKOUT1 before 
and after the actual write operation, including contiguous writes. This allows 
a smooth transition between the write and any adjacent bus operations as well 
as other writes. For this pad cycle, STRB and WE are always high. The R/W 
signal always changes state on the rising edge of CLKOUT1 during the pad 
cycle before and after a write or sequence of writes. This prevents bus conten- 
tion when making the transition between read and write operations. Note that 
for a sequence of contiguous writes, R/W stays low. 


Timing of valid addresses for writes differs, depending on what activities occur 
before and after the write; between writes, and for the first and last write in a 
series, valid ADDRESS occurs on the rising edge of CLKOUT1. If a read im- 
mediately follows a write or series of writes, valid ADDRESS for that read cycle 
occurs one half CLKOUT 1 cycle early — that is, on the rising edge, rather than 
on the falling edge, of CLKOUT1. Note that this is an exception to the usual 
read cycle address timing. 


For the actual write operation, STRB and WE both go low on the falling edge 
of CLKOUT1 and stay low until the next falling edge of CLKOUT1 (for zero 
wait-state write cycles). For one or more wait-state (multicycle) writes, STRB 
and WE remain low but go high again on the falling edge of CLKOUT1 at the 
beginning of the pad cycle. Write data is driven approximately at the falling 
edge of STRB and WE and is held for approximately one half cycle of 
CLKOUT1 after STRB and WE go high (see Appendix A for actual timing spec- 
ifications). 


Note that transitions on the external parallel interface control outputs 
(CLKOUT1, STRB, WE, and RD) are all initiated by the same two internal 


External Interface Timing 


| Read/Write Timings 


clocks. Since these signals also use the same output buffer circuitry, they all 
switch within close tolerances of each other, as specified in Appendix A. 


Transitions on the address bus and other related outputs (IS, PS, DS, R/W, and 
BR) are initiated by the same internal signals that cause transitions on the con- 
trol outputs; however, the internal device logic used to generate these outputs 
differs somewhat from the circuitry used for the control outputs. Because of 
this, transitions on the address bus and related outputs typically occur some- 
what later than control-line transitions. 


Timings of control outputs with respect to CLKOUT1 are specified in Appendix 
A; address timing with respect to CLKOUT1 can be derived from timings pro- 
vided for address with respect to control signals and control signal timing with 
respect to CLKOUT1. Therefore, for example, the delay from CLKOUT1 falling 
to address bus valid at the begininng of a read cycle is calculated as [H — 
tsu(a)R] + Maximum positive RD to CLKOUT1 skew (refer to Appendix A for 
specific timing values). Other interface timings with respect to CLKOUT1 can 
be calculated in the same manner. 


The following timing diagrams illustrate the varieties of logical timings for both 
read and write cycles in various orders. 


Figure B—1.Memory Interface Operation for Read-Read-Write (0 Wait States) 
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Figure B—2. Memory Interface Operation for Write-Write-Read (0 Wait States) 
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CLKOUT1 


ADDRESS | | | | 


DATA 


Reaby | |! I | | 
| Two-cycle read l Four-cycle write | 
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Appendix C 


Instruction Cycle 


This appendix details the instruction cycle timings for the 'C5x processors. 
Instructions are classified into several categories according to their cycle ti- 


mings. 
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Instruction Cycle Summary 


C.1 Instruction Cycle Summary 


Each class of instructions is listed in a separate table showing the number of 
cycles required for a ’C5x instruction to execute in a given memory configura- 
tion singly or in repeat mode. The column headings in the table indicate the 
program source location (PR, PDA, PSA, PE), defined as follows: 


PR The instruction executes from internal program ROM. 

PDA _ The instruction executes from internal dual-access program RAM. 
PSA __ The instruction executes from internal single-access program RAM. 
PE The instruction executes from external program memory. 


If a class of instructions requires memory operand(s), row divisions in table in- 
dicate the location(s) of the operand(s), as defined below: 


DARAM The operand is in internal dual-access RAM. 
SARAM The operand is in internal single-access RAM. 


Ext The operand is in external memory. 
ROM The operand is in internal program ROM. 
MMR The operand is a memory-mapped register. 


MMPORT _ The operand is a memory-mapped io port. 


Note that the internal single-access memory on each 'C5x processor is divided 
into 2K-word blocks that are contiguous in address space: 


0800h—OFFFh 
1000h—17FFh 
1800h—1FFFh 
2000h—27FFh 


One 1K-word block 2800h—2BFFh Data address range 
One 1K-word block 0800h—OBFFh | Data address range | 


One 2K-word block 0800h—OFFFh Data address range 
One 1K-word block 1000h—13FFh Data address range 


All ’C5x processors support parallel accesses to these internal single-access 
blocks. However, one single-access block allows only one access per cycle. 
In other words, the processor can read/write on one single-access memory 
block while accessing another single-access block. 


Four 2K-word block 


Data address range 


Instruction Cycle Timings 


The number of cycles required for each instruction is given in terms of the pro- 
cessor machine cycles (CLKOUT1 period). The additional wait states for pro- 
gram/data memory and I/O accesses are defined below: 


p Program memory wait states. Represents the number of additional 
clock cycles the device waits for external program memory to respond 
to an access. 


d Data memory wait states. Represents the number of additional clock 
cycles the device waits for external data memory to respond to an ac- 
cess. 

lo I/O wait states. Represents the number of additional clock cycles the 
device waits for an external I/O to respond to an access. 

n Repetitions (where n>2 to fill the pipeline). Represents the number of 


times a repeated instruction is executed. 


The above variables can also use the subscripts src, dst, and code to indicate 
source, destination, and code, respectively. 


Note that all external reads require at least one machine cycle, while all exter- 
nal writes require at least two machine cycles. However, if an external write is 
immediately followed or preceded by an external read cycle, the external write 
requires three cycles. See Appendix B for details. If an on-chip wait-state gen- 
erator is used to add m (m>0) wait states to an external access, both the exter- 
nal reads and the external writes require m+1 cycles, assuming that the exter- 
nal READY line is pulled high. If the READY input line is used to add m addi- 
tional cycles to an external access, external reads require m+1 cycles and ex- 
ternal write accesses require m+2 cycles. Refer to software wait state genera- 
tion in Section 5.3 and to Appendix A for READY electrical specs. 


The instruction cycle timings are based on following assumptions: 


(.} Atleast the next four instructions are fetched from the same memory sec- 
tion (internal or external) that was used to fetch the current instruction (ex- 
cept in case of PC discontinuity instructions like B, CALL, etc.). 

_} Inthe single execution mode, there is no pipeline conflict between the cur- 
rent instruction and the instructions immediately preceding or following 
that instruction. The only exception is the conflict between the fetch phase 
of the pipeline and the memory read/write (if any) access of the instruction 
under consideration. See Chapter 3 for pipeline operation. 

[.] Inthe repeat execution mode, all conflicts caused by the pipelined execu- 
tion of that instruction are considered. 


Instruction Cycle Summary 
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Class | 


1-word, 1-cycle, no memory operands 


ABS, ADCB, ADD, ADDB, ADRK, ANDB, APAC, BSAR, CLRC, SETC, CMPL, 
CMPR, CRGT, CRLT, EXAR, IDLE, IDLE2, LACB, LACL #k, MAR, MPY #k, 
NEG, NOP, NORM, ORB, PAC, POP, PUSH, RPT #k, ROL, ROLB, ROR, 
RORB, SACB, SATH, SATL, SBB, SBBB, SBRK, SFL, SFLB, SFR, SFRB, 
SPAC, SPM, SUB #k, XC, XORB, ZAP, ZPR 


A 
a a 
OO 


t ADD, ADRK, LACL, MPY, SBRK, SPM, SUB, XC, and RPT are nonrepeatable instructions. 


Class IIA 


1-word, 1-cycle, memory read operand 


ADD, ADDC, ADDS, ADDT, AND, BIT, BITT, CPL, LACC, LACL, LACT, LPH, 
LT, LTA, LTP, LTS, MPY, MPYA, MPYS, MPYU, OR, PSHD, RPT, SQRA, 
SQRS, SUB, SUBB, SUBC, SUBS, SUBT, XOR, ZALR 


| Cycle Timings for a Single Instruction | 


ea Fe TS Ee 
ac ae (a TC 
i a Pe 
[Operandet ed [ed [ed [ee 
Cycle Timings for a Repeat (RPT) Execution 
PR POA | PSA PE 
a 
— 


Operand Ext 


t If the operand and the code are in the same SARAM block. 
+ RPT is a nonrepeatable instruction. 


Instruction Cycle Timings 


Class IIB 


1-word, 1-cycle, memory-mapped register read 


LAMM 


Cycle Timings for a Single Instruction 
Opera Te 


Cycle Timings for a Repeat (RPT) Execution 


OperngwwRT [APT 
Operand WMPORT 


Tt Add one more cycle for peripheral memory-mapped access. 
+ Add nmore cycles for peripheral memory-mapped access. 


Class Ill 


2-word, 2-cycle, long-immediate operand, no memory access 


ADD, AND, LACC, LAR, MPY, OR, SUB, XOR, RPT, RPTB, RPTZ 


Cycle Timings for a Single Instruction | 
a i aa 5 enn 
Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


Class IVA 
1-word, 1-cycle, memory write operand 


SACH, SACL, SAR, SPH, SPL, SST #0, SST #1, POPD 


Poa Psa Pe 
DO 


ot 


Topeanden sdf ee fee fe | 
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ee POA PSA PE 
[OperanapaRAM fn fe fa | 


| Operand SARAM 


Operand Ext 2n+nd 2n+nd 2n+nd 2n+2+nd+p 


t If the operand and the code are in the same SARAM block. 
Class IVB 

1-word, 1-cycle, memory-mapped register write 
SAMM 


[sree Tings Tora Single nstrucion =| 
Operanawmar [4 [aft |e 


Cycle Timings for a Repeat (RPT) Execution 


[Crean fa taf 


t Add one more cycle if source is a peripheral memory-mapped register. | 
+ Add nmore cycles if source is a peripheral memory-mapped register. 


Class V 
1-word, 1-cycle, read and write memory 


APL, OPL, XPL, DMOV, LTD 


Cycle Timings for a Single Instruction 


a 
ne i 


1 
1 
Bt 
Operand Ext 242d 5+2d+p 


Cycie Timings for a Repeat (RPT) Execution 


Operand DARAM 


Operand SARAM 


Operand Ext 


t If the operand and the code are in the same SARAM block. 


Instruction Cycle Timings 


Class VI 


2-word, 2-cycle, memory read and write 


APL, OPL, XPL 


Cycle Timings for a Single Instruction 


Operand SARAM 2n-1 


Operand Ext 4n-1+2nd | 4n—-1+2nd | 4n—1+2nd 


T If the operand and the code reside in same SARAM block. 
Class Vila 


2-word, 2-cycle, memory read operand 


CPL #lk,dma 


0 0 


Operand SARAM 


Operand DARAM 


SPR PDR, 
Operand SARAM 


Operand Ext 


t If the operand and the code are in the same SARAM block. 
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Class Vilb 
2-word, 2-cycle, memory write operand 


SPLK #Ik 
Cycle Timings for a Single Instruction 
Operand DARAM 


CPR [ POA | PSA 
ae ee 
3t 


Operand Ext 5+d+2p 
Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


t If the operand and the code are in the same SARAM block. 
Class VIII 


2-word, 4-cycle, PC discontinuity, no delay slot 


B, BANZ, BCND, CALL, CC 


Cycle Timings for a Single Instruction 
Cycle Timings for a Repeat (RPT) Execution 
Not Repeatable 


t Applicable only to conditional instructions. 
+ The 'C5x performs speculative fetching by reading two additional instruction words. If PC 
discontinuity is taken, these two instruction words are discarded. 


Class IX 
2-word, 2-cycle, PC discontinuity, 2 delayed slots 


BD, BANZD, BCNDD, CALLD, CCD 


Cycle Timings for a Single Instruction 


> 
+ 

ES 
A?) 

++ 


Cycle Timings for a Repeat (RPT) Execution | 
Not Repeatable | 


t Applicable only to conditional instructions. 
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Instruction Cycle Summary 


Class X 


1-word, 4-cycle, PC discontinuity, no delayed slots 


BACC, CALA, RETC, RET, NMI, INTR, RETE, RETI, TRAP 


Cycle Timings for a Single Instruction 


Condition Falset 
Cycle Timings for a Repeat (RPT) Execution | 
| Not Repeatable 


t The’C5x performs speculative fetching by reading two additional instruction words. If PC discon- 
tinuity is taken, these two instruction words are discarded. 
+ Applicable only to conditional instructions. 


Class XI 


1-word, 2-cycle, PC discontinuity, 2 delayed slots 


BACCD, CALAD, RETCD, RETD, TRAPD 


Cycle Timings for a Single Instruction 


t Only applicable to conditional instructions. 


NM 
+ 
To 


Class XIl 


2-word, 3-cycle, block data transfer, data to data space 


BLDD #lk,dma; BLDD dma,#lk 


Cycle Timings for a Single Instruction 


Source DARAM 3+2p 
Destination DARAM 
Source SARAM 
Destination DARAM | | 
Source Ext 3+dgr¢ 3+dsr¢ 3+dsrc 3+dec+2p 
Destination DARAM : 
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- 
—~+ 


Source DARAM 
Source SARAM 
Destination SARAM 
Source Ext 
Destination SARAM 
Source DARAM 4+dogst 
Destination Ext 


4+deo 


Source SARAM 4+doyst 4+dycr 4+dyst 

| Destination Ext 
Source Ext 44+dsro+Agst 4+dsro+ gst 44+dsr¢+Agst 6+dgrot+Ags¢+2p 
Destination Ext | 


Cycle Timings for a Repeat (RPT) Execution 


Go 

a 
g 
¢) 


é) 
4. 
ee 
i?) 
BS 
—+ 


Source DARAM n+2 n+2 

Destination DARAM 

Source SARAM n+2 n+2 
Desinaton Oana | 

Destination DARAM 

Source DARAM n+2 n+2 


Destination SARAM 


Source SARAM 
Destination SARAM 


Source Ext n+2ndsr¢ 
Destination SARAM 


n+2 
2nt 


N+2nderco n+2nderc¢ N+2+Ndg¢+2p 
Source DARAM 2n+2+Ndgsz 2nt+2+ndyst 2N+2+Ndgot 2N+2+Ndgs¢ +2p 
jailed 
Source SARAM 2n+2+Ndgcz 2n+2+ndgcz 2N+2+Ndygcz 2n+2+ndgs¢+2p 
Destination Ext 


Source Ext 4n+ndgo+ndgst* 4n+ndecot+ndgst A4ntnderotNdgst | 4N+2+ndootndgs¢+2p 
Destination Ext ! 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ if both operands and the code are in the same SARAM block. 

Class XIll 


1-word, 2-cycle, block data transfer, data to data space 


n+2 
2nt 


BLDD BMAR,dma; BLDD dma,BMAR 


Cycle Timings for a Single instruction 


Source DARAM : 
Destination DARAM 
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Cycle Timings for a Single Instruction (Continued) 


po PR PA PSA PE 

Source DARAM -2+p 

Source SARAM 2+p 

Source Ext 2+Agro 2+derc 2+derc¢ 2+dec+p 
Desinatonoaran | | |e 

Source DARAM 2 2+p 

Source SARAM | 2 2+p 
destraionsaran | | 

Source Ext 2+dsr¢ 2+dsrc 2+dsrc 2+dgro+P 

Source DARAM 3+dyst 3+dyst 3+dg¢ 5+dgsttp 
[desinatonet fo Le fe 

Source SARAM 3+dyst 3+dgs¢ 3+dys¢ 5+dgs¢t+P 
[destnaionee [fee ee 


Source Ext | 34¢d5ro+ dust 
Destination Ext 


34+dsr¢+d dst 3+dgr¢+dast S+dgro+dgs¢+P 


Cycle Timings for a Repeat (RPT) Instruction 


Source DARAM n+1 n+1 n n+1+p 
Destination DARAM 


+1 
Source SARAM n+1 n+1 +1 n+1+p 
+1 


n+1+Nnder¢ 


Destination DARAM _ | | 
n+14+Nndgro n+14+ndgp+p 


Source Ext 
Destination DARAM i 


Source DARAM n+1 n+1 n+1 n+1+p 
Destination SARAM n+3t 

Source SARAM | n+ n+1 n n+1+p 
Destination SARAM | 2n-1# 2n-1# 2n-1* 2n—1+p? 


n+38 
2n+18 


Source Ext n+1+ndgot N+14Ndgro n+14Ndgyc n+1+ndg-o+p 
_Destnaion sana | | ME | Nines | 
Destination Ext | | | 


. 
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Cycle Timings for a Repeat (RPT) Instruction (Concluded) a | — 
rrrCOS PRTC PDA PSSA [re 


Source SARAM | 2n+1+ndyer 2nt+1+ndyee 2n+1+ndysy } 2n+1+ndye+p : 
a 
Destination Ext 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 

Class XIV 


2-word, 3-cycle, block data transfer, program to data space 


BLPD #lk,dma 


Cycle Timings for a Single Instruction | | 


PO ot—“;‘ié‘~i PRC CE PACT PSA PE 
Source DARAM/ROM | | 3+2Pcode | 
Source SARA | | | | oS 
[Destpatrbaram | 
Destination DARAM | 
Source DARAM/ROM | | 1 3 | 1 3+2pcode i tttst~™S 
Detnanseraw || 
Source SARAM | - 3 — 3+2Pcode | 
[Dentnatonsaram [| 
Destination SARAM 44+Dsrcot | 


Source DARAM/ROM | 4+dyor 4+dost 4+dost 6+dosrt2Pcode 
Destination Ext 4+dys¢ 4+dyst | 44dys¢ 6+dasit2Pcode 


Destination Ext 
Cycle Timings for a Repeat (RPT) Execution 
PPR CT POAT PSA 

Source DARAM/ROM | n+2 n+2 n+2 | +2+2Doode 

Destination DARAM | inal 

Source SARAM n+2 n+2 n+2 N+2+2Dcode | 
[Deaesonparan | 
Source Ext | n+2+NDsro N+2+NPsro ss. NF 2+NDeyre | n+2+npsro+2Dcode | 
mami | | | | 


C-12 Instruction Cycle Timings 


Instruction Cycle Summary 


Cycle Timings for a Repeat (RPT) Execution (Continued) 


n+2 | 
Destination SARAM 2nt 


PRO 

n+2 

2n* 
Source Ext N+2+NDerct N+2+NPerco n+2+NParo _ “N+24+NDsrot+2Pcode 
Deetrton sama | oi ial 
Source DARAM/ROM | 2n+2+ndg.z 2n+24+ndysy 2n+2+Ndysz 2N+2+Ndgsit2Pcode 
oetraint ff fe [ee 
Source SARAM 2n+2+ndgst 2n+2+ndgst 2n+2+ndgs¢ 2n+2+Nndgsi+2Pcode 
otras fn fn fe ee 
Source Ext 4n4NDocotNdgset | 4n+nPorotNdgs¢ | 4N+NPsrctNdgst | 4N+2+NPsro+Ndysit+2Pcode 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 


Class XV 


Source DARAM/ROM 
Destination SARAM 
Source SARAM 


n+4t 


N+2+2prode 
2N+2Pcode* 


1-word, 2-cycle, block data transfer, program to data space 


BLPD BMAR,dma 


7 Cycle Timings for a Single Instruction 


cae 7 ee 2 
Source DARAM/ROM 2+Pcode 
Source SARAM | 2+Pcode 
‘onairaidaran ||| 
Destination DARAM 


Source DARAM/ROM | | 2+Pcode 
Destination SARAM 


2 
3t 
| Source SARAM : | 2 2+Pcode | 
Destination SARAM | 3t 
Source Ext 2+Dsrc 2+Dsrc | 2+Dsrc 2+DPsrct2Pcode 
Destination SARAM | 3+DPsro! 
Source DARAM/ROM | 34-dust 3+dast 34+dust 5+dust+Pcode | 
Destination Ext 
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| Cycle Timings for a Single Instruction (Continued) 


Destination Ext 
3+DPsro+Agst 5+Psrco+ddst+Pcode 


Source Ext 
Cycle Timings for a Repeat (RPT) Execution 


Destination Ext 


3+DPsro+dgst 


Source DARAM/ROM n+1 
Destination DARAM 
Source SARAM 
Destination DARAM 
Source Ext 
Destination DARAM 


n+1 +Doode 


n+1 
n+1 


n 


a) 
+ 
—_ 


N+14+NPsre N+1+NDpere 


N+14+NPgro N+1+NPsrc+Pcode 


n 
n 


+1 
+1 
+1 
+1 


Source DARAM/ROM n+1 N+1+DPcode 
Source SARAM n+1 n N+1+DPcode 
Destination SARAM 2n—14 2n—1* 2n—1# 2n—-1+Dcoge* 
n+3t 
2n+18 


Source Ext 
Destination SARAM 
Source DARAM/ROM 
Destination Ext 
Source SARAM 
Destination Ext 
Source Ext 
Destination Ext 


N+1+NPoro 


n+1 +NPsro n+1 +NDsrco n+1 +NDsrotPcode 
N+3+NPgrot 
2n+14+ndgst 2n+1+Ndgst+Pcode 


4n—14nPsct | 4n-14NPgct | 4n+1+NPsrc+Ndgst+Pcode 


Ndgst Ndgst 


t If the destination operand and the code are in the same SARAM biock. 
+ If both the source and the destination operands are in the same SARAM block. 
$ lf both operands and the code are in the same SARAM block. 


4n—1 + n Psrc + 
Ndast 
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7 Instruction Cycle Summary 


Class XVI 


1-word, 2-cycle, block data transfer, data to program space 


BLDP dma 


Cycle Timings for a Single Instruction 
Source DARAM 
Destination DARAM | 


Hin OE (i 
Soule? alah 
> 
Source E Ext 2+dosrc¢ 2+dosre 2+dsro 3+dsrc+Pcode 
cl ial 
Source [ DARAM 
ae Ee 
| Source SARAM | | 
Destination SARAM Fo Yaw 
aes Ext | 24dsro 2+derc 2+ dre 3+dsre+Pcode 
ae 
| Source DARAM 3+ Past 3+Past 3+Past 4+Pdst+Pcode 
a 
Destination Ext 4+Past" 
Destination Ext 


Cycle Timings for a Repeat (RPT) Execution 


Source DARAM n+1 N+1+Dcode 
Destination DARAM | 


Source SARAM n+1- | n+1+Deode 
Destination DARAM 


co 
Source ma 


n+1 +Ndgr¢ n+1+nder¢ n+14+ndgr- n+2+ndsro+Pcode 
Source DARAM n+1 N+1+Dcode 


C-1 


or 


Instruction Cycle Summary 


Cycle Timings for a Repeat (RPT) Execution (Continued) 


Source SARAM n+1 n+1 N+1+Pcode 
Source Ext n+t+ndge =| nt 1 +ndgye N+14Ndsr¢ n+2+ndsro+Pcode 
ee a a 
Source DARAM 2n+1+Npgst 2N+1+NPgst 2nN+1+NPgst 2N+2+NPgstt+Pcode 
joeamaonse fn pe 
Source SARAM 2n+1+NPgst 2n+14+NDPgst 2n+1 +NPgst 2N+2+NPgsttPcode 
joeamionse nL | amt [ee 
Source Ext 4n—1+Ndere+MPase | 4N—1+Nds-+NPgse | 4N—14Nderet+NPast | 4N+1+Ndere+NPgsrt 
ee ied ie ee 


t If the destination operand and the code are in the same SARAM block. 

+ If both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 

1 If the source operand and the code are in the same SARAM block. 


Class XVII 


1-word, 3-cycle, table read 


TBLR 


Cycle Timings for a Single Instruction | 


aren es 7 

Source DARAM/ROM 3+Pcode 
Source SARAM 3 3+Pcode 
ominwononrwe ||P 

Destination DARAM 

Source DARAM/ROM 3 3+Pcode 
ominaensaran || fe 

Source SARAM 3 3+Pcode 
ee a ee 

Destination SARAM 4+Dsrc! 

Source DARAM/ROM 4+dust 4+dyst 4+dyst 6+dyst+Pcode 
oaiminst | f 
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Instruction Cycle Summary 


Cycle Timings for a Single Instruction (Continued) 


Destination Ext 


Source Ext 4+Dsrot+dgst 4+DPsro+dast 4+Dsro+dast 6+Psro+Agst+Pcode 
Destination Ext 


Cycle Timings for a Repeat (RPT) Execution 


Source DARAM/ROM n+2 n+2 n+2 N+2+Pcode 
Destination DARAM 


Source SARAM n+2 n+2 n+2 N+2+Dcode 

Source Ext N+2+NDPorc N+2+NPerc N+24NDero N+2+NPsrc+Pcode 
Source DARAM/ROM n+2 n+2 n+2 N+2+Pcode 
Destination SARAM La n+4t — | 
source SARAM n+2 n+2 n+2 N+2+DPcode 
Destination SARAM 2nt 2nt 2nt 2nt 


Source Ext 
Destination SARAM 
Source DARAM/ROM 
Destination Ext 
Source SARAM 
Destination Ext 
Source Ext 
Destination Ext 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ if both operands and the code are in the same SARAM block. 
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Class XVIII 
1-word, 3-cycle, table write 
TBLW 


Cycle Timings for a Single Instruction 


| Source DARAM 
Destination DARAM 
Source SARAM 
Destination DARAM 
Source Ext | 34dgrc 
Destination DARAM 

| Source DARAM 
Destination SARAM 
Source SARAM | 
Destination SARAM 
Source Ext 
Destination SARAM 
Source DARAM 
Destination Ext 

| Source SARAM 
Destination Ext 


Source Ext 
Destination Ext 


3+Pcode 
3+Pcode 


3 
4t 
3 
4t 


3+dor¢ 


3+Dcode 


34+derco 


3+ Poode | 


34+dgr¢ 3+sro+Pcode 


3+der¢ 


4+der¢ 


4+Dast 4+Past 5+Past+Pcode 


4+dsrco+Past 5+sro+Pastt Pode 


Cycle Timings for a Repeat (RPT) Execution __ —_ 


4+Past | 4+Past 


4+dsro+Pust 


| Source DARAM __—i|_sin+2 n+2 N+2+Dcode 

Destination DARAM | 
| Source SARAM n+2 n+2 | N+2+Dcode | 7 
| Destination DARAM 


Source Ext 
Destination DARAM 


N+2+Ndere 


n+2 | 

n+2 

n+2 N+2+Dcode | 
n+3t 


n+2 n+2 N+2+Dcode 
2nt 2n* | 2nt 
2n+1§ 


n+2+ndoro N+2+Nndero n+2+ Ndgro+Pcode | 
n+3+ndgot 
2n+2+nDpust | 2n+2+npasy 2n+2+nPast 2n+3+NPust+Pcode 


Source DARAM n+2 n+2 
Destination SARAM 


Source SARAM 
Destination SARAM 


n+2 
2nt 


Source Ext 
Destination SARAM 


Source DARAM 
Destination Ext 


N+2+Ndgro 


=y 
2 
on | 
S 
=. 
fs) 
3 
o 
8 
® 
= 
3 
> 
Q 
a 


C-18 


Cycle Timings for a Repeat (RPT) Execution (Continued) 


Source SARAM 2N+2+NPgst 2N+2+NPgst 2N+2+NPast 2N+3+NPgst+Poode 
Destination Ext 


Source Ext 4n+ndgro+NPast 4n+ndgrot+NPagst 4n+ndoro+NPgst 4n+14+Ndoro+NPgstt 
Destination Ext Pcode 


t If the destination operand and the code are in the same SARAM block. 
+ If both the source and the destination operands are in the same SARAM block. 
§ If both operands and the code are in the same SARAM block. 


Class XIX 
2-word, 3-cycle, multiply accumulate 


MAC #lk,dma 


Cycle Timings for a Single Instruction 


Operand1 DARAM/ROM 3+2Pcode 
Operand2 DARAM 


Operand1 SARAM 3+2Pcode 
Operand2 DARAM 


Operand2 DARAM 

Operand1 DARAM/ROM 3+2Pcode 
Operand1 SARAM 3 3 3 3+2Pcode 
Operand2 SARAM 4t 4+2Dcodet 
Operand1 Ext 3+Pop1 3+Pop1 3+Pop1 3+Pop1+2Pcode 
| Operand1 DARAM/ROM | 3+dop2 3+dop2 3+dop2 3+d0p2+2Dcode 
Operand1 SARAM 3+dop2 3+dop2 3+dop2 3+dop2+2Pcode 
Operand2 Ext 


Cycle Timings for a Repeat (RPT) Execution 


=a 
+ 
Nh 
= 
+ 
NO 


Operand1 DARAM/ROM | n+2 
Operand2 DARAM 
Operand1 SARAM n+2 
Operand2 DARAM 


n+2+2 Doode 


S 
+ 
N 


Instruction Cycle Summary 


| _——- Gyele Timings for a Repeat (RPT) Execution (Continued) 


Operand1 Ext N+2+NPop1 N+2+NPop1 N+2+NPop1 N+2+NPop1+2Pcode 


Operand2 DARAM : 
Lat DE ea 


Operand1 DARAM/ROM | n+2 +2 n+2 
Operand2 SARAM 
Operand1 SARAM | n+2 | +2 n+2 N+2+2Drode 
Operand2 SARAM 2n+2t 2n+a2t 2n+2t 2n+2t 
p2 p2 p2 


Operand2 SARAM | 
Operand2 Ext | | 
Operand1 SARAM | N+2+Ndop2 N+2+Ndop2 n+2+ndop2 n+2+ndop2+2Pcode 
oma [|e 


Operand1 Ext — 2N+2+NPop1tN | 2N+2Z+NPop7+N | 2N+2+NPop7+ 2N+2+NPop1+Ndgpat 
Operand2 Ext do | do ndp 2Pcode | 


T If both operands are in the same SARAM block. 


Class XX 


1-word, 2-cycle, multiply-accumulate 


MADS dma 


| | Cycle Timings for a Single Instruction 
EDA a <a 


ee 

Operand! DARAM/ROM | 2+Pcode 

Operand1 SARAM | 3 2+Pcode 
ee a 

Operand1 Ext — 2+Pop1 2+Popt 2+Pop1 
ee aca eee a 

Operand1 DARAM/ROM 2+Pcode 
eld a A a 

Operand 1 SARAM 2 2 2 2+Pcode 

Operand2 SARAM 

Operand2 Ext 
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Cycle Timings fora Single Instruction (Continued) 


Operand2 Ext 


Operand2 Ext 


Cycle Timings for a Repeat (RPT) Execution 


n+1 
Operand1 SARAM n+1 
Operand2 DARAM 

n+1 

n+1 


Operand2 SARAM | 7 


| Operand! DARAM/ROM | n+1+ndop2 ] n+14Nndop2 
Operand2 Ext | 


iw " ~ " =_— 
n+1+ndop2 


Operand1 SARAM | n+1+ndop2 n+1+ndop2 Ss | n+1+Ndop2+Pcode 

| Operand2 Ext | 
Operand1 Ext | 2n+1+NnPop1+ 2n+1+NPop1+ | 2n+1+NPop1+ 2N+1+NPop1+Ndgpat 
Operand2 Ext Ndop2 Ndop2 Ndop2 Pcode 


t If both operands are in the same SARAM block. 


Class XX 


a 
Operand1 DARAM/ROM | n+ n+1 n+1 N+1+Dcode 
cpomezonran ff 
| 
Operand2 DARAM | 
Operand1 DARAM/ROM +1 | n+4 n+1 N+1+Pcode 
er a A A 
Operand1 SARAM +1 n+1 Y n+ N+14Dcode 


2-word, 3-cycle, multiply accumulate with data move 


MACD #lk,dma 


| Cycle Timings for a Single Instruction 


a ee fc a 
Operand1 SARAM 3 3+2Pcode 
Operand2 DARAM | 


Operand1 DARAM/ROM_ | 3+2Pcode 
Operand2 DARAM 
C-21 


Instruction Cycle Summary 


Cycle Timings for a Single Instruction (Continued) 


Operand2 DARAM 


3+2Pcode 
4+2Dcode* 


Operand1 DARAM/ROM 
Operand2 SARAM 


ii 


Operand1 SARAM 
Operand2 SARAM 


/ 
oOo ++ 


| Operand1 Ext — 3+Pop1 | 
Operand2 SARAM 
Operand1 DARAM/ROM | 3+dop2 
Operand2 Ext$ 
Operand1 SARAM 
Operand2 Ext 

| Operand1 Ext 


Operand2 Ext* 


3+Pop1 | 3+ Pop? 3+Pop1 +2Dcode 


3+dop2 3+ dop2t 2Pcode 


4+Pop1t+dop2 4+Pop1t+dop2 4+Pop1 +dop2 4+Pop1 +dopa+2 Peode | 


Cycle Timings for a Repeat (RPT) Execution 


Operand1 DARAM/ROM | n+2 


n+2 | | n+2 | N+2+2Pcode 
Operana2 DARAM a 
Operand1 SARAM n+2 n+2 n+2 | N+2+2Poode 
Operand2 DARAM | 
Operand1 Ext n+2+nPop} 
2n 2n+2Dceode 
2n an 


| Operand2 DARAM 
Operand1 DARAM/ROM 

2n+2Pcode 

n 


Operand2 SARAM 
Operand1 SARAM 
2N+NPop1 2N+NPop1 2N+NPop1+2Pcode 
2n+2+NPop1t 


N+2+NPop1 N+2+NPop1+2Pcode 


NO 
a) 


Operand2 SARAM 3nt 3nt | 2ns2t 


Operand1 Ext 
Operand2 SARAM 
Operand1 DARAM/ROM 
Operand2 Ext" 


2N+NPop1 


N+2+Ndop2 


GO 
+H 
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Instruction Cycle Summary 


Cycle Timings for a Repeat (RPT) Execution (Continued) 


Operand2 Ext? 


Operand2 Ext‘ Ndop2 Ndop2 Ndop2 2Pcode 


t If operand2 and code are in the same SARAM block. 

+ If both operands are in the same SARAM block. 

§ If both operands and code are in the same SARAM block. 

¥ Data move operation is not performed when operand 2 is in external data memory. 


Class XXIl 


1-word, 2-cycle, multiply accumulate with data move 
MADD dma 


Cycle Timings for a Single Instruction 


Operand1 DARAM/ROM 
Operand2 DARAM 


Operand1 SARAM 2+Pcode 
Operand2 DARAM 


Operand1 SARAM 
Operand2 SARAM 


2+Pcode 
3+Pcode* 


Operand2 SARAM 
Operand1 DARAM/ROM | 2+dop2 2+don2 2+dop2 2+dop2+Pcode 
eel ie ene ee 
Operand1 SARAM 2+don2 2+dop2 2+dop2 2+dop2+Pcode 
ee ee i a 
Operand2 Ext! 
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Instruction Cycle Summary 


Cycie Timings for a Repeat (RPT) Execution 


Operand1 DARAM/ROM N+1+Pcode 


Operand2 DARAM 


Operand1 SARAM n+1 n+1 1 N+1+DPcode a 
Operand2 DARAM 

2n-1 2n-1 n—1 2n-1+Pcode 
Operand2 SARAM 2n+it 
Operand1 SARAM 2n-1 2n-1 2n—1 2n—1+Pcode 


Operand2 SARAM 3n—1# 3n—-1# 2n+it 3n—1* 
3n—1# 
3n+18 


2n—1+NPop1 2n—-1+NPop1 2n-1+NPop1 2n-1+NPop1+Pcode | 


N+1+Ndop2 n+1+Nndop2 n+1 +Ndop2+Pcode 


n+ 
n+ 
2 


Operand1 DARAM/ROM 


t If operand 2 and code reside in same SARAM block. 
+ If both operands reside in same SARAM block. 


2n+1+NPop1+ 2N+1+NPop1+Ndopot+ 
Ndop2 Pcode | 
$ If both operands and code reside in same SARAM block. 


4 Data move operation is not performed when operand2 is in external data memory. 
Class XXill 
2-word, 2-cycle, memory map register load 


LMMR dma,#lk 


Cycle Timings for a Single Instruction 


re | ae 0 ee eee 2 eee 
Source DARAM | 2+2Pcode 

ood fT Le 
Source SARAM 2 2+2Dcode 

eee a 
Destination MMR? 
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Instruction Cycle Summary 


Cycle Timings for a Single Instruction (Continued) 


Destination MMPORT 


Source SARAM 3+iOgst 3+iOgst 3+iOgst 5+2PcodetiOugst 
Destination MMPORT 4t 


ee ee 

Source DARAM 2N+2Dcode 

Source SARAM 2n 2N+2Dcode 
cesratenmme ff fame fe 

Source Ext 2n+ndere 2n+Nderc 2nt+Nderc | 2N+14+Nderc+2Pcode 
eee i 

Source DARAM 3N+Ni0gst 3N+Nidgsz 3N+Nidgs¢ 3N+3+Ni0gst+2Pcode 

Source SARAM 3N+Nidgst 3n+nidgsz 3N+Nidgst 3N+3+Ni0gs++2Pcode 
cetrtenmmront fn | | otmmyt | 

Source Ext 4n—1+ndect | 4N—14+Ndgct+ | 4N—14Ndgct | 4N+2+Ndgpot+ NiOgst + 


t If the source operand and the code are in the same SARAM block. 
+ Add one more cycle for peripheral memory-mapped register access. 
§ Add n more cycles for peripheral memory-mapped register access. 


Class XXIV 
2-word, 2-cycle, memory map register store 
SMMR dma,#lk 


as 5 | RE FE Ee [ee 
Destination DARAM | 24+2Peode 
jsoocomie | Te 
Destination SARAM 2 2+2Pcode 
sooo | | 
Destination Ext 3+dys¢ 3+dyst 3+dys¢ 5+dgstt2Pcode 
sooo fe 
Destination DARAM ss | 3+i0g,¢ 3+i0src 3+i0src 4+i0gre+2Dcode 
sorcomort [on ff 
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Instruction Cycle Summary 


Cycle Timings for a Single Instruction (Continued) 


Destination SARAM 3+iOsr¢ 3+i0gr¢ 3+10sr¢ 3+i0sre+2Pcode 
Source MMPORT 
Destination Ext 4+10sro+dysz 4+l0gro+dyst 4+105r¢+dgs¢ 6+10sr¢+dgsit2Pcode 


Cycle Timings for a Repeat (RPT) Execution 


Destination DARAM 
Source MMRS§ 


Destination SARAM 2n 2N+2Pcode 
a ee 
Destination Ext 3N+Ndgsz 3N+Ndygez 3N+Ndgez 3N+3+Ndyst+2Pcode 
coome | fe eee 
Destination DARAM 2N+NiOgro 2N+NiOcrco 2N+NiOsro 2N+1+NiOgre+2Pcode 
[sorcewort [on fm || eee 
Destination SARAM 2N+NiOgr¢ 2N+NiOgre 2N+NiOsro 2n+1+NiOsre+2Pcode 
[sowsemrort [on | ances | 
Destination Ext 5n—2+ndgs¢t 5n—2+ndgst+ 5n—2+ndgst+ 5N+1+Ndgs¢+NiOsro+ 


t If the destination operand and the code are in the same SARAM block. 
+ Add one more cycle for peripheral memory-mapped register. 
$ Add n more cycles for peripheral memory-mapped register access. 


Class XXV 


2-word, 3-cycle, output port 
OUT dma, port 


Cycle Timings for a Single Instruction 


Source SARAM 3+iOgst 3+iOgs¢ 3+i0gs¢ 5+iOgst+2Pcode 


6+ dsrotlOdst+2Poode 
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| Cycle Timings for a Repeat (RPT) Execution 


8ntnidgs: 8n+nidges 8n+nidgs: 3n+3+nidgs+2Dcode 


| Source SARAM 3N+NiOgst 3N+NiOgst 3N+NiOgst 3N+3+NiOgs:+2Pcode 
3N+1+Nidgs;t 
Source Ext 5n—-2+Ndgro+ 5n—-2+Ndercot+ 5n—-2+nderot+ 5n+1+Ndoo+NiOgst+ 
MOgst NlOgst MNOgst 2Pcode 


t If the source operand and the code are in the same SARAM block. 


Class XXVI 
2-word, 2-cycle, input port 


IN dma,port 


Cycle Timings for a Single instruction 


2+i0sr¢ 


- Cycle Timings for a Repeat (RPT) Execution 


| Destination SARAM | 2n+nidgrc 2n+Nidsre | 2N+Nidgre 2n+1+Nidsre+2Pcode 
| 2n+2+nidg;c! 
Destination Ext 4n—1+ndgyt | 4n—1+ndgert 4n—14ndgei+ 4n+2+ndyg¢tNiOsro+ 
ee MNOsre | MOsre MOsre 2Pcode 


t If the destination operand and the code are in the same SARAM block. 


Class XXVII 
1-word, 2-cycle, pipeline-protected, memory read 
LDP dma; LST #0,dma; LST #1,dma, LAR ARn,dma 


| Cycle Timings for a Single Instruction 
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Cycle Timings for a Repeat (RPT) Execution 
Source DARAM 


Source SARAM 2n 2N+Pcode 
2n+it 


t If the source operand and the code are in the same SARAM block. 


Class XXVIII 


1-word, 2-cycle, pipeline-protected, nonrepeatable 


LDP #k; LAR ARN,#k 


Cycle Timings for a Single Instruction 


Source Dana [2 +f? 4.2 | howe 


2 2+Dcode 


Cycle Timings for a Repeat (RPT) Execution 


Not Repeatable 


t If the source operand and the code are in the same SARAM block. 
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Appendix D 


This appendix contains information that is necessary to upgrade a ’C25 sys- 
tem into a ’C5x system. The information consists of a detailed list of the pro- 
gramming differences and hardware and timing differences between the two 
generations of TMS320 DSPs. Note that the ’C50, C51, and ’C53 have the 
same features with the exception of memory map; so within this appendix, any 
reference to 'C5x applies to ’C50, ’C51, and ’C53, unless otherwise stated. 
This appendix contains the following: 


Topic Page 


Package and Pin Layout 


D.1 Package and Pin Layout 


The ’C25 is available in both a 68-pin CPGA and a 68-pin PLCC as shown in 
Figure D—1 and Figure D-2, respectively. The ’C5x devices are packaged in 
a 132-pin Quad Flat Pack package (QFP), as shown in Appendix A. 


Figure D—1.’C25 68-Pin Ceramic Pin Grid Array 


Top View 
28,448 (1.120) 
27,432 (1.080) 


17,02 (0.670) 
Nom 


28,448 (1.120) 
| 27,432 (1.080) 


Thermal Resistance Characteristics 


R Junction-to-free-air 
@J 


thermal resistance 


Junction -to-case 
thermal resistance 


4,953 (0.195) 


a 1,397 (0.055) — 


3,302 (0.130) 
2,794 (0.110) eiede 0,508 (0.020) || A Te oon pees Ce) ip 


0,406 (0.016) 1,473(0.058) 058) ° 


Bottom View 2,54 
, (0.100) 
T.P. 


nN 
gi 
rs 
So 
at 
iS 
=| 
U 
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ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES. 


D-2 System Migration 


Package and Pin Layout 


Figure D-2.'C25 68-Pin Plastic Leaded Chip Carrier 


1,35 (0.053) ° 
1,19 (0.047) * * 
0,25 (0.010) R Max 
2,79 (0.110) in 3 places 
AOU UEP) 24,33 (0.956) 
24,13 (0.950) 
4,50 (0.177) (see Note A) 
4,24 (0.167) 
25,27 (0.995) 
23,62 (0.930) 25,02 (0.985) 


23,11 (0.910) 
(At Seating Plane) 


1,27 (0.050) T.P. 


| (See Note B) 24,33 (0.956) 
| 24,13 (0.950) 
(see Note A) 
1,22 (0.048) 

1,07 (0.042) * “° 

| Ss 25,27 (0.995) 
0,69 (0.027) 27 (0. 

Seating Plane 25,02 (0.985) 


1,52 (0.060) Min 


0,81 (0.032) 
0,66 (0.026) 7 a 
Thermal Resistance Characteristics 
(0025 
Min 


Junction-to-free-air . 
ReJA thermal resistance CW 

| 

+ 0,51 (0.020) 

i 0,36 (0.014) 
| 


Lead Detall 
Notes: A. Centerline of center pin, each side, is within 0,10 (0.004) of package centerline as determined by this dimension. 
B. Location of each pin is within 0,127 (0.005) of true position with respect to center pin on each side. 


R Junction-to-case 
8JC thermal resistance 


ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES. 
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Package and Pin Layout 
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When a’C25 is upgraded to a ’C5x, there is minimal layout modification. The 
’C5x signals are on the same side (except the CLKR and AO pins) and in the 
same order (except the X1 and X2/CLKIN pins) as those of the 'C25. 
Figure D—3 shows the pin-to-pin relationship between the ‘C25 and the 'C5x 
devices in J-leaded chip carrier packages. Note that the two devices are not 
drawn to scale. The power (Vpp) and ground (Vss) signals are symmetrically 
positioned on the ’C5x so that, in conjunction with the OFF signal, the device 
is not damaged by inserting itin the wrong orientation. The ’C5x has more pow- 
er and ground pins to provide higher performance and more noise immunity 
than the ’C25. 


Figure D-—3.’C25-to-’C5x Pin/Signal Relationship 
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8 16 
L} 19 15 
Vsst]} 20 14D Vop 
Vssl} 21 113 1) Voo 
L} 22 112 [) TACK 
D7] 23 Theo 
DEL} 24 110 |] CLKOUT1 
DSL] 25 109 |] XF 
D4{] 28 108 |] HOLDA 
D3L} 27 107 [] TDX 
D2L] 28 106 1] DX 
D1] 29 10 |] TFSX/TFRM 
DOL] 30 104 |] FSX 
TMSL] 31 103 1} CLKMD2 
Voc 32 102 [] Vss 
VoL] 33 101 L] Vss 
TCKL] 34 100 [] TDO 
Vegi} 35 9 1] Vop 
Vso] 36 98 1] Vop 
L} 37 97 {] X1 
INTIL] 38 96 [] X2/CLKIN 
INT2L] 39 95 [] CLKIN2 
INTSL] 40 941] BR 
INT4L4 41 26 44[ 93 1] STRB 
NMIL] 42 92 1] RAW 
DRL] 43 O- TMWOKNR ODMHMROKNM YN 91] PS 
TDR 4 GPL ek ls OCC FEE EES of] 1S 
FSRALI 4 s9 1] DS 
CLKRL] 48 ss [| 
Voc} 47 87 1) Vss 
Voc} 48 86 L] Vss 
49 85 LL] 
50 a4 1] 


51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 83 
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Note: Pins without callouts are unassigned (reserved). 
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Three ’C25 signals (CLKOUT2, MSC and SYNC) are not present on the ’C5x. 
Because the ’C5x operates with a divide-by-two clock, it can be synchronized 
with reset. Therefore, there is no need for the SYNC signal. With only two 
phases, there are no external timings that tie to the CLKOUT2 of the ’C25. 


Some of the ’'C25-equivalent pins have additional capabilities on the ’'C5x. The 
‘C5x supports external direct memory access of the on-chip single-access 
RAM block. For this reason, the following signals are now bidirectional: 


AOQ-A15 = address lines 
STRB = memory access strobe 
R/W = read/write 
BR = bus request 


The ’C5x serial port transmit clock (CLKX) can now be configured as an output 
that operates at one-fourth the machine clock rate. CLKX is configured as an 
input by reset. The C25 CLKX pin is always an input. 


The ’C25 operates with a four-phase clock. This device’s machine rate is 
one-fourth the CLKIN rate. CLKOUT1 and CLKOUT2 operate at the machine 
rate and are 90° out of phase. The 'C5x operates with a two-phase clock. The 
device’s machine rate is one-half the CLKIN rate. In addition, the ’C5x offers 
a divide-by-one clock input feature so that the device’s machine rate equals 
the CLKIN rate. CLKOUT1 operates at the machine rate. Figure D—4 shows 
both the ’C25 and the 'C5x clocking schemes. 


Figure D—4.’C25 and ’C5x Clocking Schemes 
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Package and Pin Layout 


The ’C5x MP/MC (microprocessor/microcomputer) pin is sampled only while 
RS is low. Changes on this pin are ignored while RS is high. The mode can be 
changed during execution by changing the MP/MC bit in the PMST register. 
On the ’C25, any change on the MP/MC pin affects the operation of the device, 
regardless of the state of RS. 


The ’C5x IACK signal goes low only on the first machine cycle of the fetch of 
the first word of the interrupt vector. The ‘C25 IACK goes low on each 
wait-state cycle, as well as on the first machine cycle, but it is valid only during 
CLKOUT1 low (during CLKOUT1 high, it has a specific meaning for emulator/ 
test operations). Figure D—5 illustrates this difference. 


The ’C5x device includes some additional functions not included with the ’C25. 
These functions and associated pins are as follows: 


TDM serial port = TCLKR, TCLKX, TDR, TDX, TADD, TFRM 

Emulation interface = EMUO, EMU1/OFF, IAQ, TCK, TDI, TDO, TMS, 
TRST 

Timer borrow = TOUT 

Divide-by-one clock = CLKIN2, CLKMD1, and CLKMD2 

Fourth external interrupt = INT4 

Nonmaskable interrupt = NMI 

Read enable = RD 

Write enable = WE 


OOOOUU OU 


The ’C5x package also includes 12 additional power and 13 additional ground 
pins. These additional power and ground pins enable the device to operate at 
much faster speeds. Twenty pins are reserved for future ’C5x spinoff devices. 


Figure D—5.’C25 IACK Versus ’C5x IACK 
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D.2 Timing 


The ’C25 and the ’C5x operate with some timing differences. These timing dif- 
ferences include aspects of the on-chip operation as well as aspects of the ex- 
ternal memory interfacing. One key difference is that the 'C5x is capable of op- 
erating at two to three times the speed of a’C25. Another key difference is that 
the 'C25 operates with a three-deep pipeline, while the ’C5x operates with a 
four-deep pipeline. Key differences in the external memory interface encom- 
pass the faster 'C5x and include certain external interface enhancements. The 
final key difference is that some compatible operations execute in a different 
number of machine cycles. This section describes these differences. 


D.2.1 Device Clock Speed 


D.2.2 Pipeline 


The ’C25 operates its machine cycles with a divide-by-four clocking scheme. 
The ’C5x uses a divide-by-two clocking scheme. This means that a’C25, oper- 
ating with a 40-MHz CLKIN, executes its machine cycles within 100 ns, while 
the 'C5x, which is operating with the same CLKIN, executes its machine cycles 
in 50 ns. This clocking arrangement changes the way that the signals of the 
devices are specified. Many of the ‘C25 timing values, given in the TMS320 
Second-Generation Digital Signal Processor Data Sheet, are specified as 
quarter-phase (Q) + N ns. The timing values of the 'C5x are defined in 
half-phases (H). 


The ’C25 operates with a three-deep pipeline, while the ’C5x operates with a 
four-deep pipeline. This means that anytime there is a program counter (PC) 
discontinuity (for example, branch, call, return, interrupt, etc.), it takes four 
cycles to complete with the 'C5x, whereas it takes three cycles on the C25. 
The ’C5x, however, also has delayed instructions that take only two cycles to 
complete. 


D.2.3 External Memory Interfacing 


The ’C5x is designed to execute external memory operations with the same 
signals as the ’C25. As mentioned above, the ’C5x operates at twice the instruc- 
tion rate of the C25 when both operate with the same input clock. The ’'C5x 
uses its software wait-state generators to compensate for this interface differ- 
ence. The ’C5x device, operating with one software wait state, has similar 
memory timing to the ‘C25 operating with no wait states. However, external 
writes require two cycles on the ’C5x devices. The exact timing of the signals 
differ because of the more advanced process used with the ’C5x. 


The ’C5x has two additional memory interface signals to reduce the amount 
of external interfacing circuitries. The RD signal can be used to interface direct- 
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ly to the output enable pin of another device, while the WE signal can be directly 
connected to the write enable pin of another device. This alleviates the need 
of gating STRB and R/W to generate the equivalent signals. 


D.2.4 Execution Cycle Times 
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Some of the ’C25 instructions require additional cycles or program words to 
execute on the ’C5x. The function of these instructions is the same, but the for- 
mat and pipeline execution are enhanced to operate with the 'C5x architec- 
ture. 


The IN and OUT instructions are now two-word instructions. They execute on 
the ’C5x in the same number of cycles as with the ’'C25, but the assembler gen- 
erates a two-word instruction for the 'C5x. Note that the ‘C5x IN and OUT 
instructions behave differently in RPT mode. See Chapter 4 for details. Two 
words are used because the ’C5x can address 65,536 I/O ports; the 'C25 ad- 
dresses 16. The ’C5x can address sixteen of its I/O ports in data memory 
space. This allows any instruction with data-memory-addressing capability to 
also read or write directly to an I/O port instead of having to pass it through a 
temporary on-chip data memory location. For example, a value can be read 
directly from an external analog-to-digital converter into the ALU via an I/O 
port. 


The modification of the three mode bits of the serial port are executed in 
two-cycle/two-word instructions with the 'C5x. However, any or ail of three bits 
can be modified with one instruction without affecting other bits in the register. 
This is done with the PLU instructions. 


The NORM instruction modifies the auxiliary register (AR) on the execute 
(fourth) phase of the pipeline, while the ARAU operations occur on the decode 
(second) phase. The two instructions following a NORM instruction should not 
use the same auxiliary register for an address. If the two instructions following 
NORM change the auxiliary register pointer (ARP), then the NORM update of 
the AR is executed on the new ARP, not the old one. See Chapter 4 for NORM 
instruction description. The assembler supports an optional way to test for this 
condition and automatically compensate by adding NOP instructions to the 
code. This modification is made to the listing and object files and does not af- 
fect your source code. 


Unlike the ’C25, the auxiliary registers are also accessible in the data address 
space on the ’C5x. This allows these registers to be loaded with the CALU 
instructions for advanced-addressing modes. However, take care when using 
this feature because the CALU operations write to the auxiliary registers on the 
execute phase of the pipeline and, therefore, are subject to the same charac- 
teristics of the NORM instruction. The assembler supports the option to flag 
these conflicts for resolution. 
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D.3 Instruction Set 


The ’C5x instruction set is a superset of the ’C25 instruction set. The instruction 
set of the ’C25 is upward source-code compatible. This means that all of the 
instruction features of the 'C25, implemented and code written for the ’C25, 
can be reassembled to run on the ’C5x. 


The serial port mode control bits have been moved from the status registers 
to the serial port control register. Because they are no longer part of the CPU 
registers, they no longer have direct instructions to set or clear them. The bits 
of the SPC can be manipulated easily with the PLU instructions. The following 
table shows the instructions used to replace the serial port instructions (note 
that the data page pointer must be set to zero to execute these new instruc- 
tions): 


#0FFF7h,SPC 
#8,SPC 


#OFFDFh,SPC 
#020h,SPC 
#0FFFBh,SPC 
#4,SPC 


Note that any or all three bits can be set in one execution of the OPL instruction, 
while any or all three bits can be cleared using the APL. The bits can be toggled 
with the XPL instruction. The I/O ports of the device are addressable in data 
memory space on the ’C5x devices. This means any instruction that can ad- 
dress data memory can also address the I/O ports. 


There are a number of new instructions on the ’C5x devices. These instruc- 
tions provide a more orthogonal addressing scheme and exercise the new 
CPU enhancements. To simplify the description of the instruction set, a num- 
ber of different instructions are combined into single new instructions with ad- 
ditional operand formats, as in this example: 


ADD *+ ADD *+ 


ADDK  OFFh ADD #0FFh 
ADLK OFFFFh ADD #OFFFFh 
ADDH  *+ ADD *4,.16 


Refer to Chapter 4 for the detailed discussion of the instruction set. 


The IDLE instruction, when executed, stops the CPU from fetching and ex- 
ecuting instructions until an unmasked interrupt occurs. The ’'C25 automatical- 
ly enables the interrupts globally with the execution of the IDLE instruction; this 
saves the extra instruction word/cycle required to execute the EINT (enable 
interrupts globally) instruction. Upon receipt of the interrupt, the "C25 executes 
the interrupt vector and resumes operations. The 'C5x does not automatically 
enable the interrupts globally with its IDLE instruction. If the interrupts are not 
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globally enabled, then the CPU resumes execution with the instructions follow- 
ing the IDLE instruction, without taking the interrupt trap. If the interrupts are 
globally enabled, the ’C5x operates like the 'C25. In addition, a second low- 
power mode is available with IDLE2 instruction. This mode operates the same 
as IDLE except that the CPU will resume only after an external interrupt. See 
Chapter 4 for IDLE/IDLE2 instruction details. 


The ’C5x repeat counter is 16 bits wide (the 'C25 repeat counter is 8 bits wide). 
This means that, when loading from RAM, the RPT instruction supports repeat 
counts up to 65,536. The assembler allows the RPT to support a16-bit immedi- 
ate repeat count also. Note that RPT with long immediate addressing is, how- 
ever, a two-word instruction. 
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D.4 On-Chip Peripheral Interfacing 


The ’C5x has more peripherals than the ’C25; many ’C5x peripherals are en- 
hancements of the ’'C25 peripherals. The ’'C25 has three peripheral circuits: 
serial port, timer, and 16 I/O ports. In addition to these peripherals, the ’C5x 
has software wait states and a divide-by-one clock. 


The serial port of the ’C5x has been enhanced in that the CLKX pin can be con- 
figured as either an input or an output (CLKX is always an input on the ’C25). 
CLKX is configured as an input upon a device reset to maintain compatibility 
with the ’C25. The new serial port status bits are now mapped to a 
memory-mapped register that is used exclusively for the serial port. The serial 
port modes are no longer controlled via status register 1. Therefore, serial port 
modes that are changed by using LST1 instruction will no longer work. The 
mode bits must be set/reset via the serial port control register (SPC). The data 
transmit (DXR) and data receive (DRR) registers have been moved in the 
memory map from locations 1 and 0 to 33 and 32, respectively. 


The timer has been enhanced on the ’C5x to include a divide-down factor of 
1 to 17 and can be stopped or reset via software. These additional features are 
controlled via the timer control register (TCR). Upon reset, the divide-down 
factor is set to 1, and the timer is enabled to maintain compatibility with the 
‘C25. The timer (TIM) and period (PRD) registers have been moved in the 
memory map from locations 2 and 3 to locations 36 and 37, respectively. 


The 16 input/output ports of the 'C5x are addressable in the data memory 
space. This allows direct access of the I/O space by the core CPU and sup- 
ports bit operation in the |/O space via the PLU. The |/O space is increased 
from 16 ports to 65,536 ports. However, no additional decode circuitry is nec- 
essary if only 16 ports are used. 


The ’C5x includes software wait-state generators that are mapped on 
16K-word page sizes in the program and data memory spaces. There are also 
wait-state generators for the I/O ports. The I/O space wait-state generators 
can be mapped on two-word or on 8K-word boundaries. These wait-state gen- 
erators allow the system to be programmed for 0, 1, 2, 3, 4, or 7 wait states, 
eliminating the need of an off-chip interfacing circuitry. External access wait 
states can be extended further via the READY signal. 
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The ’C5x DSPs support emulation through a dedicated emulation port. The 
emulation port is a superset of the IEEE 1149.1 (JTAG) standard and can be 
accessed by the XDS510 emulator. For details on the JTAG protocol, refer to 
the IEEE 1149.1 specification. The information in this appendix supports 
XDS510 Cable #2563988-001 Rev B. 


This appendix contains the following sections 
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E.1 Cable Header and Signals 


To perform emulation with the XDS510, your target system must have a 14-pin 
header (two 7-pin rows) with connections as shown in Figure E-1. Table E—1 
describes the emulation signals. 


Although you can use other headers, recommended parts include: 


Straight header, unshrouded DuPont Electronics® part number 
67996-—114 

Right-angle header, unshrouded DuPont Electronics® part number 
68405-114 


Figure E—1. Header Signals and Header Dimensions 


TMS TRST 
TDI GND Header Dimensions: 
PD (+5 V) No pin (key) Pin-to-pin spacing: 0.100 in. (X,Y) 
TDO GND Pin width: 0.025 in. square post 
TCK_RET GND Pin length: 0.235 in., nominal 
TCK GND 
EMUO EMU1 


JTAG test clock. TCK is a 10-MHz clock source 
from the emulation cable pod. This signal can be 
used to drive the system test clock. 


a 


EMUO Tt | vO _| Emulation pin. 
EMU! CLO _| Emulation pint 
cable is connected and that the target is powered 
up. PD should be tied to +5 volts in the target sys- 
JTAG test clock return. Test clock input to the 
TCK_RET XDS510 emulator. May be a buffered or unbuf- 
| fered version of TCK. 
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TCK 


| © | STAG test data output. 


Presence detect. Indicates that the emulation 
tem. 


E.2 Bus Protocol 


The IEEE 1149.1 specification covers the requirements for JTAG bus slave de- 
vices ('C5x) and provides certain rules. Those rules are summarized as fol- 
lows: 


[_} The TMS/TDI inputs are sampled on the rising edge of the TCK signal of 
the device. 

[_} The TDO output is clocked from the falling edge of the TCK signal of the 
device. 


When JTAG devices are daisy-chained together, the TDO of one device has 
approximately a half TCK cycle set up to the next device’s TDI signal. This type 
of timing scheme minimizes race conditions that would occur if both TDO and 
TDI were timed from the same TCK edge. The penalty for this timing scheme 
is a reduced TCK frequency. 


The IEEE 1149.1 specification does not provide rules for JTAG bus master 
(XDS510) devices. Instead, it states that it expects a bus master to provide bus 
slave compatible timings. The XDS510 provides timings that meet the bus 
slave rules and also provides an optional timing mode that allows you to run 
the emulation at a much higher frequency for improved performance. 
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Figure E—2 shows a portion of the XDS510 emulator cable pod. These are the 
functional features of the emulator pod: 


[-} Signals TDO and TCK_RET can be parallel-terminated inside the pod if 
required by the application. The default is that these signals are not termi- 
nated. 


[Cj Signal TCKis driven with a 74AS1034 device. Because of the high current 
drive (48 mA Ic, /lon), this signal can be parallel-terminated. If TCK is tied 
to TCK_RET, then you can use the parallel terminator in the pod. 


[-} Signals TMS and TDi can be generated from the falling edge of TCK_RET, 
according to the IEEE 1149.1 bus slave device timing rules. They can also 
be driven from the rising edge of TCK_RET, which allows a higher 
TCK_RET frequency. The default is to match the IEEE 1149.1 slave de- 
vice timing rules. This is an emulator software option that can be selected 
when the emulator is invoked. In general, single-processor applications 
can benefit from the higher clock frequency. However, in multiprocessing 
applications, you may wish to use the IEEE 1149.1 bus slave timing mode 
to minimize emulation system timing constraints. 


[) Signals TMS and TDI are series-terminated to reduce signal reflections. 


L} A 10-MHz test clock source is provided. You may also provide your own 
test clock for greater flexibility. 
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Figure E—2. Emulator Pod Interface 
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GND 
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EMUO (Pin 13) 
EMU1 (Pin 13 


TCK_RET (Pin 9) 
PD (Pin 5) 


33 Q 


TMS (Pin 1) 
74AS258 ie 
TDI (Pin 3) 


74AS1034 
TCK (Pin 11) 


74AS1034 
74AS1034 
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TRST (Pin 2) 


74AS1004 


-,° 


CL 


Figure E—3 and Table E—2 show the signal timings for the XDS510. Timing pa- 
rameters are calculated from standard data sheet parts used in the cable pod. 
These timings are for reference only. Texas Instruments does not test or guar- 


antee these timings. 


The emulator pod uses TCK_RET as its clock source for internal synchroniza- 
tion. TCK is provided as an optional target system test clock source. 
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Figure E—3. Emulator Pod Timings 
TCK_RET 


TMS TDI (Default) 


eS 


TMS TDI (Optional) 


Table E—2.Emulator Pod Timing Parameters 
es Reference Description | | | Min Max | 


TCKmax 


TCK_RET high pulse duration 
TCK_ RET low pulse duration 


BE td OCTMXmin) TMS/TDI valid from TCK_RET low (default timing) a a 
MXmax 

js | Meow TMS/TDI valid from TCK_RET high (optional timing) = 
d)XTMSmax 


| tsupcrDomin) __| TDO setup time to TCK_RET high 
TDO hold time from TCK_RET high 


It is extremely important to provide high-quality signals between the emulator 
and the target processor. If the distance between the emulation header and 
the processor is greater than 6 inches, the emulation signals should be buff- 
ered. Sections E.4 and E.5 illustrate typical connections between the target 
processor and the emulation header. 
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E.4 Target System Test Clock 


Figure E—4 shows an application with the system test clock generated in the 
target system. In this application the TCK signal is left unconnected. 


Figure E-4. Target-System Generated Test Clock 


Greater Than 
6 Inches 


Emulator Header 


EMUO 
EMU1 


| System Test Clock 


There are two benefits to having the target system generate the test clock: 


1) You can set the test clock frequency to match your system requirements. 
The emulator provides only a single 10-MHz test clock. 


2) You may have other devices in your system that require a test clock when 
the emulator is not connected. 


Multiprocessor Configuration 


E.5 Multiprocessor Configuration 


Figure E—5. Multiprocessor Connections 
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Figure E—5 shows a typical multiprocessor configuration. This is a daisy- 
chained configuration (TDO-TDI daisy-chained), which meets the minimum 
requirements of the IEEE 1149.1 specification. The emulation signals in this 
example are buffered to isolate the processors from the emulator and provide 
adequate signal drive for the target system. One of the benefits of a JTAG test 
interface is that you can generally slow down the test clock to eliminate timing 
problems. Several key poirits to multiprocessor support are as follows: 


[.} The processor TMS, TDI, TDO, and TCK should be buffered through the 
same physical package to control timing skew better. 


[-} The input buffers for TMS, TDI, and TCK should have pullups to 5 volts. 
This will hold these signals at a known value when the emulator is not con- 
nected. A puilup of 4.7 kQ or greater is suggested. 


[_} Buffering EMUO and EMU1 is optional, but highly recommended to pro- 
vide isolation. These are not critical signals and do not need to be buffered 
through the same physical package as TMS, TCK, TDI, and TDO. Buff- 
ered and unbuffered signals are shown in Figure E—6 and Figure E—7. 


No signal buffering. In this situation, the distance between the header and 
the processor should be no more than 6 inches. 
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Figure E—6. Unbuffered Signals 


Emulation signals buffered. The distance between the emulation header 
and the processor is greater than 6 inches. The emulation signals —TMS, TDI, 
TDO, and TCK_RET— are buffered through the same package. 


Figure E—7. Buffered Signals 


nae More than 6 in a 
+5 V 


Emulator Header 


EMUO 
EMU1 


[} The EMUO and EMU1 signals must have pullups to 5 volts. The pullup re- 
sistor value should be chosen to provide a signal rise time less than 10 us. 
A 4.7-kQ resistor is suggested for most applications. EMUO—1 are I/O pins 
on the ’C4X and ’C5X; however, they are only inputs to the XDS510. In 
general, these pins are used in multiprocessor systems to provide global 
run/stop operations. 
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a) It is extremely important to provide high quality signals, especially on the 
processor TCK and the emulator TCK_RET signal. In some cases, this 
may require you to provide special PWB trace routing and to use termina- 
tion resistors to match the trace impedance. The emulator pod does pro- 
vide optional internal parallel terminators on the TCK_RET and TDO. TMS 
and TDI provide fixed series termination. 
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E.6 Emulation Timing Calculations 


The following are a few examples on how to calculate the emulation timings 
in your system. For actual target timing parameters, see the appropriate de- 
vice data sheets. 


Assumptions: 


tsu(TTMS) 


th(TTMS) 
td(TTDO) 


td (bufmax) 


td(bufmin) 


t(bufskew) 


lickfactor 


Target TMS/TDI setup to TCK high 
Target TMS/TDI hold from TCK high 
Target TDO delay from TCK low 
Target buffer delay maximum 
Target buffer delay minimum 


Target buffer skew between two devices 
in the same package: 
[ta(oufmax) — ta(butminy] x 0.15 


Assume a 40/60 duty cycle clock 


Given in Table E-2 (page E-6): 
taocrMSmax) XDS510 TMS/TDI delay from TCK_RET 


td(XTMX) 


low, maximum 


min XDS510 TMS/TDI delay from 
TCK_RET low, minimum 


taxTMSmax) %XDS510 TMS/TDI delay from TCK_RET 


td(xTMXmin) 


high, max 


XDS510 TMS/TDI delay from TCK_RET 
high, minimum 


tsu(xTDOmin) TDO setup time to XDS510 TCK_RET 


There are two key timing paths to consider in the emulation design: 
(1) the TCK_RET/TMS/TDI (tordtck_ TMs) Path, and 
(2) the TCK_RET/TDO (tpratck_ Too) path. 


high 


10 ns 
5 ns 
15 ns 
10 ns 


1 ns 


1.35 ns 
0.4 


20 ns 


6 ns 


24ns 


7ns 


3ns 


In each case, the worst case path delay is calculated to determine the maxi- 
mum system test clock frequency. 
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Case 1: 


Case 2: 


Case 3: 
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Single processor, direct connection, TMS/TDI timed from TCK_RET low (de- 
fault timing). 


[t(d¢xTMSmax) + tsucrTms)] / teckfactor 
(20 ns + 10 ns) /0.4 
75 ns (13.3 MHz) 


tordtck_TMS 


[t(dcrTDO) + tsucxTDOmin)] / tickfactor 
(15 ns + 3 ns) /0.4 
45 ns (22.2 MHz) 


In this case, the TCK/TMS path is the limiting factor. 


tordtck_TDO 


Single processor, direct connection, TMS/TDI timed from TCK_RET high (op- 
tional timing). 


td(xTMSmax) + tsu(TTMS) 
(24 ns + 10 ns) 
34 ns (29.4 MHz) 


tordtck_TMS 


[tart po) + tsucxTDOmin)] / tickfactor 
(15 + 3) /0.4 


45 ns (22.2 MHz) 


tordtck_TDO 


In this case, the TCK/TDO path is the limiting factor. One other thing to consid- 
er in this case is the TMS/TDI hold time. The minimum hold time for the 
XDS510 cable pod is 7 ns, which meets the 5-ns hold time of the target device. 


Single/multiple processor, TMS/TDI buffered input; TCK_RET/TDO buffered 
output, TMS/TDI timed from TCK_RET high (optional timing). 


tordtck_TMS taaxTMSmax) + tsucrTms) + 2ta(bufmax) 
24 ns + 10 ns + 2 (10) 


54 ns (18.5 MHz) 


tordtck TDO == [ta¢rtD0) + tsuxTDOmin) + t(bufskew)] / ttekfactor 
(15 ns + 3ns + 1.35 ns) /0.4 


58.4ns (20.7 MHz) 


In this case, the TCK/TMS path is the limiting factor. The hold time on TMS/TDI 
is also reduced by the buffer skew (1.35 ns) but still meets the minimum device 
hold time. 


XDS510 Design Considerations 


Case 4: 


Emulation Timing Calculations 


Single/multiprocessor, TMS/TDI/TCK buffered input; TDO buffered output, 
TMS/TDI timed from TCK_RET low (default timing). 


tordtck TMS = [taaTMSmax) + tsucTTMs) + toufskewl / tekfactor 
= (24ns + 10 ns + 1.35 ns) / 0.4 


= 88.4ns (11.3 MHz) 


tordtck TOO }=- =_ (tacrTD0) + tsuxTDOmin) + ta(butmax)] / tekfactor 
= (15ns+3ns+10ns)/0.4 
= 70 ns (14.3 MHz) 


In this case, the TCK/TMS path is the limiting factor. 


In amultiprocessor application, itis necessary to ensure that the EUMO—1 lines 
can go from a logic low level to a logic high level in less than 10 ws. This can be 
calculated as follows (remember that t = 5 RC): 


5(Rpullup x Ngevices * Cload _per_device) 
5(4.7kQ x 16 x 15pF) 


5.64 us 


trise 
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F-14 XDS510 Design Considerations 


Appendix F 


Analog Interface Peripherals and 
Applications 
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Texas Instruments offers many products for total system solutions, including 
memory options, data acquisition, and analog input/output devices. This ap- 
pendix describes a variety of devices that interface directly to the TMS320 
DSPs in rapidly expanding applications. 


Topic Page 


F.1 Multimedia Applications 


Multimedia integrates different media through a centralized computer. These 
media can be visual or audio and can be input to or output from the central 
computer via a number of technologies. The technologies can be digital based 
or analog based (such as audio or video tape recorders). The integration and 
interaction of media enhances the transfer of information and can accommo- 
date both analysis of problems and synthesis of solutions. 


Figure F—1 shows both the central role of the multimedia computer and the 
multimedia system's ability to integrate the various media to optimize informa- 
tion flow and processing. 


Figure F—1.System Block Diagram 
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System Design Considerations 


Multimedia systems can include various grades of audio and video quality. The 
most popular video standard currently used (VGA) covers 640 x 480 pixels 
with 1, 2, 4, and 8-bit memory-mapped color. Also, 24-bit true color is sup- 
ported, and 1024 x 768 (beyond VGA) resolution has emerged. There are two 
grades of audio. The lower grade accommodates 11.25-kKHz sampling for 8-bit 
monaural systems, while the higher grade accommodates 44.1-kHz sampling 
for 16-bit stereo. 


Audio specifications include a musical instrument digital interface (MIDI) with 
compression capability, which is based on keystroke encoding, and an input/ 
output port with a 3-disc voice synthesizer. In the media control area, video 
disc, CD audio, and CD ROM player interfaces are included. Figure F—2 
shows a multimedia subsystem. 


Analog Interface Peripherals and Applications 


Multimedia Applications 


The TLC32047 wide-band analog interface circuit (AIC) is well suited for multi- 
media applications because it features wide-band audio and up to 25-kHz 
sampling rates. The TLC32047 is a complete analog-to-digital and digital-to- 
analog interface system for the TMS320 DSPs. The nominal bandwidths of the 
filters accommodate 11.4 kHz, and this bandwidth is programmable. The 
application circuit shown in Figure F—2 handles both speech encoding and 
modem communication functions, which are associated with multimedia appli- 
cations. 


Figure F—2. Multimedia Speech Encoding and Modem Communication 
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Figure F—3 shows the interfacing of the C25 DSP to the TLC32047 AIC that 
constitutes the building blocks of the 9600-bps V.32 bis modem shown in 
Figure F—2. 


Figure F—3. TMS320C25 to TLC32047 Interface 
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F.1.2 Multimedia-Related Devices 


As shown in Table F—1, Tl provides a complete array of analog and graphics 
interface devices. These devices support the TMS320 DSPs for complete mul- 
Table F—1.Data Converter ICs 


timedia solutions. 
Zz Resolu- | ,_ | | 
; Conversion 
Device Description aa CLK Rate Application 


| Analog interface (5 V Portable modem and 
Analog interface Speech, modem, and 
TLOS2047 | (11.4-kHz BW) (AIC) ae 29 KHZ | multimedia 


25 kHz | Speech and modems 
Analog interface (AIC) 


roses 
Tvideopalete | Parater| “nies | _195Mez | Graphics 


TLO34058 


19.2 kHz | Speech and modems 
Serial | 14 | 19.2 kHz | Speech and modems: 


TLO5502/3 


TLO5602 | Video DAC Parallel 


‘TLC32046 


analog interac (alc) | Sarai | 14 
4 
01 | Flash Dc ——~—~*d Para | 6 | 20M | video 


14 
14 
14 
7 io 
7 14 | 
TLC5501 sh ADC | | 
TLC5601 | Video DAC Parallel| 6 | | 20MHz 
| tucissoi1 | ADC ———_| Parallel 150 KHz | Servo ctrl / speech 
TLC32071 Analog interface (AIC) Parallel | 8 fA MHz Servo ctrl / disk drive 


| Dual audio DAC+ digital 32, 37.8, | 4... 
IMSo701si8 filter 16/18 44.1, 48 kHz Digital aul - 


Table F—2. Switched-Capacitor Filter ICs 


[“pevics [Function 
TLC2470 Differential audio filter amplifier | 
TLC2471 | Differential audio filter amplifier 3.5kHz | 500mW | Yes 
= j N 


CLK +50 | 
TLC10/20 alia purpose dual filter | CLK + 100 a ae 
CLK + 50 
TLCO4/14 Low pass, Butterworth filter CLK + 100 re a 


For application assistance or additional information, please call Tl Linear 
Applications at (214) 997-3772. 


Roll-on 
soomw [Yes 
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F.2 Telecommunications Applications 


The TI linear product line focuses on three primary telecommunications appli- 
cation areas: subscriber instruments (telephones, modems, etc.), central of- 
fice line card products, and personal communications. Subscriber instruments 
include the TCM508x DTMF tone encoder family, the TCM150x tone ringer 
family, the TCM1520 ring detector, and the TCM3105 FSK modem. Central of- 
fice line card products include the TCM29Cxx combo (combined PCM filter 
plus codec) family, the TCM420x subscriber line control circuit family, and the 
TCM1030/60 line card transient protector. Personal communication (PCN) 
and cellular products include the TCM320AC3x family of 5-volt voice-band au- 
dio processors (VBAP). 


Tl continues to develop new telecom integrated circuits, such as a high-perfor- 
mance 3-volt combo family for personal communications applications, and an 
RF power amplifier family for hand-held and mobile cellular phones. 


System Design Considerations. The size, network complexity, and com- 
patibility requirements of telecommunications central office systems create 
demanding performance requirements. Combo voice-band filter performance 
is typically + 0.15 dB in the passband. Idle channel noise must be on the order 
of 15 dBrncO. Gain tracking (S/Q) and distortion must also meet stringent re- 
quirements. The key parameters for a SLIC device are gain, longitudinal bal- 
ance, and return loss. 


Telecommunications Applications 


Figure F—4. Typical DSP/Combo Interface 
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The TCM320AC36 combo interfaces directly to the 'C25 serial port with a mini- 
mum of external components, as shown in Figure F—4. Half of hex inverter U3 
and crystal Y1 form an oscillator that provides clock timing to the 
TCM320AC36. The synchronous 4-bit counters U1 and U2 generate an 8-kHz 
frame sync signal. DCLKR on the TCM320AC36 is connected to Vpp, placing 
the combo in fixed data-rate mode. Two 20-kQ resistors connected to ANLGIN 
and MIC_GS set the gain of the analog input amplifier to 1. The timing is shown 
in Figure F—5. 
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Figure F—5. DSP/Combo Interface Timing 
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Telecommunications-Related Devices. Data sheets for the devices in 
Table F-3 are contained in the 1991 Telecommunications Circuits Databook, 
(literature number SCTDO01). To request your copy, contact your nearest Tex- 
as Instruments field sales office. 


For further information on these telecommunications products, please call TI 
Linear Applications at (214) 997-3772. 


Telecommunications Applications 


Table F—3. Telecom Devices 


Po a Coding Clock Rates 5 | | hes 
Codec/Filter | 


| TCM29C13 _ 1.544, 1.536, 2.048 
TCM29C14 Aand u 1.544, 1.536, 2.048 


C.O. and PBX line cards _ 


Includes 8th-bit signal 


rT 
A 
UL 
pd 
A 


p20 | 
P2048 


TCM29C16 
TCM29C17 
TCM29C18 
~ TCM29C19 
| TCM29C23 _ 
TCM29C26 
TCM320AC36 
~ TCM320AC37 
| TCM320AC38 
TCM320AC39 


16-pin package 


2.048 16-pin package 


| _ Low-cost DSP interface | 
Aand u 


Up to 4.096 


Extended frequency range | 


Low-power TCM29C23 : 


Single voltage (+5) VBAP 


| 8 | 
peal 
| 2048] Low-cost DSP interface | 
bee! 
_ a 
8 
8 and 13° 


| Up to 4.096 8 and 13 | Single voltage (+5) VBAP | 
Up to 4.096 8 and 13 | Single voltage (+5) GSM 
Up to 4.096 8 and 13 | Single voltage (+5) GSM | 


| TP3054/64 


National Semiconductor 


TP3054/67 second source 


| TLC320ACO1_ | Linear 43.2 KHz 


TLO32040/1 | Linear Up to 19.2-kHz sampling 
TLC32044/5 | Linear Up to 19.2-kHz sampling 
| Linear Up to 25-kHz sampling 


1 5-volt-only analog interface 


For high-dynamic linearity _| 


For high-dynamic linearity 7 


National Semiconductor 


TLC32046 | 


TLC32047 Up to 25-kHz sampling 


Transient Suppressor 


| | For high-dynamic linearity 


14 
14 
14 
14 


For high-dynamic linearity 


_TCM1030__ 
_TCM'1060 


T Unless otherwise noted 


Transient suppressor for SLIC-based line card (30 A max) | 


___ Transient suppressor for SLIC-based line card (60 A max) 


Table F—4. Switched-Capacitor Filter ICs 


Device [Function [Order | Rolon 
TLC2470 Differential audio filter amplifier | s00mw | Yes | 


C4 Yes 
TLC2471 | Differential audio filter amplifier | 4 | 3.5kHz | 500 mW 
| TLC10/20 General-purpose dual filter i) a . rae 
TLCO4/14 | Low pass, Butterworth filter ee ae N a | wa | Ne 
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[ Power Out | 


Figure F—6. General Telecom Applications 
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Figure F—7. Generic Telecom Application 
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Dedicated Speech Synthesis Applications 


F.3 Dedicated Speech Synthesis Applications 


For dedicated speech synthesis applications, Texas Instruments offers a fami- 
ly of dedicated speech synthesizer chips. This speech technology has been 
used in a wide range of products including games, toys, burglar alarms, fire 
alarms, automobiles, airplanes, answering machines, voice mail, industrial 
control machines, office machines, advertisements, novelty items, exercise 
machines, and learning aids. 


Dedicated speech synthesis chips are effective in low-cost applications. The 
speech synthesis technology provided by the dedicated chips is either LPC (li- 
near-predictive coding) or CVSD (continuously variable slope delta modula- 
tion). Table F—5 shows the characteristics of the Tl voice synthesizers. 


Table F—5. Voice Synthesizers 


TI Voice Synthesizers: 


7 | | Synthesis On-Chip External Data Rate | 
famed | YP | woman ie eae 
TsPs0c4x | 8-bit | ~— LPC-10 20/32 64K/128K VROM | 1200-2400 
| tspsocix | abt =| Lpci2 | 10 | 6aKi28K | VROM | 1200-2400 


| upcto | 20 | NA ___| FromhostyP | 1200-2400 
| N/A 


| Tsps3c3o_ | stit 
TSP50C20_ tpc-io_ | 32 | NA | EPROM__| 1200-2400 


| ™s3477,_ | ~~=NA | ___CVSD | None | DRAM 16K-32K 


TI has low-cost memories that are ideal for use with speech synthesizers 
chips. Texas Instruments can also be of assistance in developing and process- 
ing the speech data that is used in these speech synthesis systems. Table F-6 
shows speech memory devices of different capabilities. Additionally, audio fil- 
ters are outlined in Table F—7. 


Table F—6.Speech Memories 


TSP60Cxx Family of Speech ROMs 
TSP60C18 TSP60C19 TSP60C20 TSP60C80 TSP60C81 
ae es 


NoofPins [16 | 8 


| Interface Parallel 4-bit Bi ele Parallel 4-bit 


| Foruse with: | — TSP50C1x TSP50C4x TSP50C4x TSP50C4x TSP50C1x 
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Dedicated Speech Synthesis Applications 


Table F—7. Switched-Capacitor Filter ICs 


[Device | Function [Order | RollLOf | Power Out | Power Down 
TLC2470 | Differential audio filter amplifier 
i Yes 


TLC2471 | Differential audio filter amplifier | 4 | 3.5kHz | 500mwW | = Yes | 


CLK + 50 
CLK + 50 
ee CLK + 100 wa fe 


Speech Synthesis Development Tools 


Software: System: 
EVM Code development tool SEB System emulator board 
Speech: SEB60Cxx System emulator boards for speech 
SAB Speech audition board memories 
SD85000 ##PC-based speech analysis 
system 


For further information on these speech synthesis products, please call Tl Linear Applications at 
(214) 997-3772. 
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Servo Control/Disk Drive Applications 


F.4 Servo Control/Disk Drive Applications 


Several years ago, most servo control systems used only analog circuitry. 
However, the growth of digital signal processing has made digital control 
theory a reality. Figure F—8 shows a block diagram of a generic digital control 
system using a DSP, along with an ADC and DAC. 


Figure F—8. Generic Servo Control Loop 
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TMS320-Based 
Digital Controller 


In a DSP-based control system, the control algorithm is implemented via soft- 
ware. No component aging or temperature drift is associated with digital con- 


trol systems. Additionally, sophisticated algorithms can be implemented and 
easily modified to upgrade system performance. 


System Design Considerations. TMS320 DSPs have facilitated the de- 
velopment of high-speed digital servo control for disk drive and industrial con- 
trol applications. Disk drives have increased storage capacity from 5 mega- 
bytes to over 1 gigabyte in the past decade, which equates to a 23,900 percent 
growth in capacity. To accommodate these increasingly higher densities, the 
data on the servo platters, whether servo-positioning or actual storage infor- 
mation, must be converted to digital electronic signals at increasingly closer 
points in relation to the platter “pick-off” point. The ADC must have increasingly 
higher conversion rates and greater resolution to accommodate the increasing 
bandwidth requirements of higher storage densities. In addition, the ADC con- 
version rates must increase to accommodate the shorter data retrieval access 
time. 
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_bervo Control/Disk Drive Applications 
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Figure F—9 shows a block diagram of a disk drive control system. 


Figure F—9. Disk Drive Control System Block Diagram 
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Table F—8 lists analog/digital interface devices used for servo control. 


Table F-8. Control Related Devices 
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Figure F—10 shows the interfacing of the C14 and the TLC32071. 


Figure F-—10. TMS320C 14 — TLC32071 Interface 
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For further information on these servo control products, please call Tl Linear 
Applications at (214) 997-3772. 
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Modem Applications 


F.5 Modem Applications 


High-speed modems (9,600 bps and above) require a great deal of analog sig- 
nal processing in addition to digital signal processing. Designing both high- 
speed capabilities and slower fall-back modes poses significant engineering 
challenges. Tl offers a number of analog front-end (AFE) circuits to support 
various high-speed modem standards. 


The TLC32040, TLC32044, TLC32046, TLC32047, and TLC320AC01 analog 
interface circuits (AIC) are especially suited for modem applications by the in- 
tegration of an input multiplexer, switched capacitor filters, high resolution 
14-bit ADC and DAC, a four-mode serial port, and control and timing logic. 
These converters feature adjustable parameters, such as filtering characteris- 
tics, sampling rates, gain selection, (sin x)/x correction (TLC32044, 
TLC32046, and TLC32047 only), and phase adjustment. All these parameters 
are software programmable, making the AIC suitable for a variety of applica- 
tions. Table F—9 has the description and characteristics of these devices. 


Table F—9.Modem AFE Data Converters 


Device | Description 


TLos2044 
ruc3204e 


TCM320AC36 | Single-supply codec/filter 


TLC32047. ‘| AlC with 11.4-kHzBW 


The AIC interfaces directly with serial-input TMS320 DSPs, which execute the 
modem’s high-speed encoding and decoding algorithms. The TLC3204x fami- 
ly performs level-shifting, filtering, and A/D and D/A data conversion. The 
DSP’s many software-programmable features provide the flexibility required 
for modem operations and make it possible to modify and upgrade systems 
easily. Under DSP control, the AlC’s sampling rates permit designers to in- 
clude fall-back modes without additional analog hardware in most cases. 
Phase adjustments can be made in real time so that the A/D and D/A conver- 
sions can be synchronized with the upcoming signal. In addition, the chip has 
a built-in loopback feature to support modem self-test requirements. 
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Modem Applications 


For further information or application assistance, please call T! Linear Applica- 
tions at (214) 997-3772. 


Figure F—11. High-Speed V.32 Bis and Multistandard Modem With the TLC320AC01 AIC 
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Figure F-11 shows a V.32 bis modem implementation using the’C25 and a 
TLC320AC01. The upper ’C25 performs echo cancellation and transmit data 
functions, while the lower ’'C25 performs receive data and timing recovery 
functions. The echo canceler simulates the telephone channel and generates 
an estimated echo of the transmit data signal. The TLC320AC01 performs the 
following functions: 


Upper TLC320ACO1 D/A Path: Converts the estimated echo, as com- 
puted by the upper 'C25, into an analog 
signal, which is subtracted from the re- 
ceive signal. 


Upper TLC320AC01 A/D Path: Converts the residual echo to a digital sig- 
nal for purposes of monitoring the residu- 
al echo and continuously training the echo 
canceler for optimum performance. The 
converted signal is sent to the upper ’C25. 
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Modem Applications 


Lower TLC320AC01 D/A Path: Converts the upper ’C25 transmit output 
to an analog signal, performs asmoothing 
filter function, and drives the DAC. 


Lower TLC320AC01 D/A Path: Converts the echo-free receive signal to 
a digital signal, which is sent to the lower 
‘C25 to be decoded. 


a TE TE EES, | 


Note: 


The example in Figure F—11 is for illustration only. In reality, one single 'C5x 
DSP can implement high-speed modem functions. 


a a ee | 
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F.6 Advanced Digital Electronics Applications for Consumers 


With the extensive use of the TMS320 DSPs in consumer electronics, much 
electromechanical control and signal processing can be done in the digital do- 
main. Digital systems generally require some form of analog interface, usually 
in the form of high-performance ADCs and DACs. Figure F—12 shows the gen- 
eral performance requirements for a variety of applications. 


Figure F—12. Applications Performance Requirements 
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Advanced Television System Design Considerations. Advanced 
Digital Television (ADTV) is a technology that uses digital signal processing to 
enhance video and audio presentations and to reduce noise and ghosting. Be- 
cause of these DSP techniques, a variety of features can be implemented, in- 
cluding frame store, picture-in-picture, improved sound quality, and zoom. The 
bandwidth requirements remain at the existing 6-MHz television allocation. 
From the IF(intermediate frequency) output, the video signal is converted by 
an 8-bit video ADC. The digital output can be processed in the digital domain 
to provide noise reduction, interpolation or averaging for digitally increased 
sharpness, and higher quality audio. The DSP digital output is converted back 
to analog by a video DAC, as shown in Figure F—13. 
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Advanced Digital Electronics Applications for Consumers 


Figure F—13. Video Signal Processing Basic System 
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VCRs, compact disc and DAT players, and PCs are a few of the products that 
have taken a major position in the marketplace in the last ten years. The audio 
channels for compact disc and DAT require 16-bit A/D resolution to meet the 
distortion and noise standards. See Figure F—14 for a block diagram of a typi- 
cal digital audio system. 


Figure F—14. Typical Digital Audio Implementation 
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The motion and motor control systems usually use 8- to 10-bit ADCs for the 
lower frequency servo loop. Tape or disc systems use motor or motion control 
for proper positioning of the record or playback heads. With the storage me- 
dium compressing data into an increasingly smaller physical size, the position- 
ing systems require more precision. 


The audio processing becomes more demanding as higher fidelity is required. 
Better fidelity translates into lower noise and distortion in the output signal. 
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The TMS57013DW/57014DW 1-bit digital-to-analog converters (DAC) in- 
clude an 8 times over sampling digital filter designed for digital audio systems, 
such as CDPs, DATs, CDis, LDPs, digital amplifiers, car stereos, and BS tun- 
ers. They are also suitable for all systems that include digital sound processing 
like TVs, VCRs, musical instruments, NICAM systems, multimedia, etc. 


The converters have dual channels so that the right and left stereo signals can 
be transformed into analog signals with only one chip. There are some func- 
tions that allow the customers to select the conditions according to their appli- 
cations, such as muting, attenuation, de-emphasis, and zero data detection. 
These functions are controlled by external 16-bit serial data from a controller 
like a microcomputer. 


The TMS5703DW/57014DW adopt 129-tap FIR filter and third-order A = mod- 
ulation to get —75-dB stop band attenuation and 96-dB SNR. The output is 
PWM wave, which facilitates analog signal through a low-pass filter. 


Table F—10 lists Tl products for analog interfacing to digital systems. 


Table F—10.Audio/Video Analog/Digital Interface Devices 


Dual audio DAC+ digital filter | TMS57013/4 16/18 32, 37.8, 
44.1, 48 kHz 
Analog interface TLC32071 

8 2 us Parallel 
D/A 8 15 ys Parallel 


| TLO1225 pps ft Parallel 
aA 
Le al 
ae 


Interface 


0 


A/D 
| Parallel 


a 

nD a 
3s 

8 

se 

<a 


Parallel 


vileoBA 3 
tere | 8 | Para 
[TisletasnaD | Tuosros__[ 8 | _7ore | 8 | Paralel 


| Flash A/D 


ee 


For further information or application assistance, please call Tl Linear Applica- 
tions at (214) 997-3772. 
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Appendix G 


This appendix provides product information regarding memories and sockets 
that are manufactured by Texas Instruments and are compatible with the ’'C5x. 
Information is also given regarding crystal frequencies, specifications, and 
vendors. 


The contents of the major areas in this appendix are listed below. 


Topic Page 


G.1 Memories 


This section provides product information on EPROM memories that can be 
interfaced with ’C5x processors. Refer to Digital Signal Processing Applica- 
tions with the TMS320 Family for additional information on interfaces using 
memories and analog conversion devices. 


Data sheets for EPROM memories are located in the MOS Memory Data Book 
(literature number SMYDOO8). 


TMS27C64 

TMS27C128 
TMS27C256 
TMS27C512 


Another EPROM memory, TMS27C291/292, is described in a data sheet (liter- 
ature number SMLS291A). 


Memories, Sockets, and Crystals 


G.2 Sockets 


Sockets 
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AMP manufactures a 132-pin quad flat pack socket for the ’C5x devices. There 
are two pieces — a base (the socket itself) and a lid. The part numbers are 


Base AMP part number 821942-1 
Lid AMP part number 821949-5 


For additional information about TI sockets, contact the nearest TI sales office 
Or: 


Texas Instruments Incorporated 
Connector Systems Dept, M/S 14-3 
Attleboro, MA 02703 

(617) 699-5242/5269 

Telex: 92-7708 
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G.3 Crystals 
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This section lists the commonly used crystal frequencies, crystal specification 
requirements, and the names of suitable vendors. 


Table G—1 lists the commonly used crystal frequencies and the devices with 
which they can be used. 


Table G-1. Commonly Used Crystal Frequencies 


G-4 


TMS320C25 40.96 MHz 


TMS320C5x 20.48 MHz | 
eee 
When connected across X1 and X2/CLKIN of the TMS320 processor, a crystal 


enables the internal oscillator. Crystal specification requirements are listed be- 
low. 


Load capacitance = 20 pF 
Series resistance = 30 ohm 
Power dissipation = 1 mW 


Vendors of crystals suitable for use with TMS320 devices are listed below. 


RXD, Inc. 
Norfolk, NB 
(800) 228-8108 


N.E.L. Frequency Controls, Inc. 
Burlington, WI 
(414) 763-3591 


CTS Knight, Inc. 
Contact the local distributor. 


Memories, Sockets and Crystals 
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ROM Codes 


rat etecet es 


The size of a printed circuit board must be considered in many DSP applica- 
tions. To fully utilize the board space, Texas Instruments offers an option that 
reduces the chip count and provides a single-chip solution to its customers. 
On the ’C51, this option incorporates 8K words of on-chip program from a 
mask programmable ROM. This allows you to use a code-customized proces- 
sor for a specific application while taking advantage of the following: 


.) Greater memory expansion 
[_] Lower system cost 

[_}] Less hardware and wiring 
Lj} Smaller PCB 


lf used often, the routine or entire algorithm can be programmed into the on- 
chip ROM of a TMS320 DSP. TMS320 programs can also be expanded by us- 
ing external memory; this reduces chip count and allows for a more flexible 
program memory. Multiple functions are easily implemented by a single de- 
vice, thus enhancing system capabilities. 


TMS320 development tools are used to develop, test, refine, and finalize the 
algorithms. The microprocessor/microcomputer (MP/MC) mode is available 
on all ROM-coded TMS320 DSP devices when accessing either on-chip or off- 
chip memory is required. The microprocessor mode is used to develop, test, 
and refine a system application. In this mode of operation, the TMS320 acts 
as a standard microprocessor by using external program memory. When the 
algorithm has been finalized, the designer may submit the code to Texas In- 
struments for masking into the on-chip program ROM. At that time, the 
TMS320 becomes a microcomputer that executes customized programs from 
the on-chip ROM. Should the code need changing or upgrading, the TMS320 
may once again be used in the microprocessor mode. This shortens the field 
upgrade time and avoids the possibility of inventory obsolescence. 


H-1 


H.1 ROM Code Flow 


Figure H—1 illustrates the procedural flow for developing and ordering 
TMS320 masked parts. When ordering, there is a one-time/nonrefundable 
charge for mask tooling. A minimum production order per year is required for 
any masked-ROM device. ROM codes will be deleted from the Tl system one 
year after the last delivery. 


Figure H-1.TMS320 ROM Code Flowchart 


| Customer TMS320 Design 


Customer Submits: 


— TMS320 New Code Release Form 

— Print Evaluation and Acceptance Form (PEAF) 
— Purchase Order for Mask Charge Prototypes 
— TMS320 Code 


Texas Instruments Responds: 
— Customer Code Input Into TI System 
— Code Sent Back to Customer for Verification 


Customer 
Approves 
Algorithm 


No 


Ti Produces Prototypes 


Customer 
Approves 
Prototypes (Minimum 
Production Order 
Required) 


Yes 


TMS320 Production 


H-2 ROM Codes 


ROM Code Flow 


A TMS320 ROM code may be submitted in one of the following formats (the 
preferred media is 5-1/4-in floppies): 


5-1/4-in Floppy: COFF format from macro-assembler/linker (preferred) 
Modem (BBS): COFF format from macro-assembler/linker 

EPROM (others): §TMS27C64 

PROM: TBP28S166, TBP28S86 


When a code is submitted to Texas Instruments for masking, the code is refor- 
matted to accommodate the Tl mask generation system. System-level verifi- 
cation by the customer is therefore necessary. Although the code has been re- 
formatted, it is important that the changes remain transparent to the user and 
do not affect the execution of the algorithm. The formatting changes involve 
the removal of address relocation information (the code address begins at the 
base address of the ROM in the TMS320 device and progresses without gaps 
to the last address of the ROM on the TMS320 device) and the addition of data 
in the reserved locations of the ROM for device ROM test. Note that because 
these changes have been made, a checksum comparison is not a valid means 
of verification. 


With each masked device order, the customer must sign a disclaimer stating: 


“The units to be shipped against this order were assembled, for expediency 
purposes, on a prototype (that is, nonproduction qualified) manufacturing 
line, the reliability of which is not fully characterized. Therefore, the antici- 
pated inherent reliability of these prototype units cannot be expressly de- 
fined.” 


and a release stating: 


“Any masked ROM device may be resymbolized as TI standard product 
and resold as though it were an unprogrammed version of the device, at 
the convenience of Texas Instruments.” 


The use of the ROM-protect feature does not hold for this release statement. 
Additional risk and charges are involved when the ROM-protect feature is se- 
lected. Contact the nearest TI Field Sales Office for more information on proce- 
dures, leadtimes, and cost associated with the ROM-protect feature. 
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H-4 


ROM Codes 


Appendix | 


Texas Instruments offers an extensive line of development tools for the 'C5x 
generation of DSPs, including tools to evaluate the performance of the proces- 
sors, generate code, develop algorithm implementations, and fully integrate 
and debug software and hardware modules. 


The following products support development of ’C5x-based applications: 


Software Development Tools: 
Assembler/Linker 
Simulator 
Optimizing ANSI C compiler 
Application Algorithms 
C/Assembly Debugger and Code Profiler 


Hardware Development Tools: 
Emulator XDS510 
‘C5x EVM (Evaluation Module) 


Each ’C5x support product is described in the TMS320 Family Development 
Support Reference Guide (literature number SPRUO11). In addition, more 
than 100 TMS320 third-party developers provide support products to comple- 
ment Tl’s offering. For more information on third-party support refer to the 
TMS320 Third Party Reference Guide (literature number SPRU052). 


For information on pricing and availability, contact the nearest Tl Field Sales 
Office or authorized distributor. 


This appendix contains the following: 


Topic Page 


Device and Development Support Tool Nomenclature 


1.1 


Device and Development Support Tool Nomenclature 


To designate the stages in the product development cycle, Texas Instruments 
assigns prefixes to the part numbers of all TMS320 devices and support tools. 
Each TMS320 member has one of three prefixes: TMX, TMP, and TMS. Texas 
Instruments recommends two of three possible prefix designators for its sup- 
port tools: TMDX and TMDS. These prefixes represent evolutionary stages of 
product development from engineering prototypes (TMX/TMDX) through fully 
qualified production devices/ftools (TMS/TMDS). This development flow is de- 
fined below. 


Device Development Evolutionary Flow: 


TMX = Experimental device that is not necessarily representative of the final 
device's electrical specifications. 


TMP __ Final silicon die that conforms to the device's electrical specifications 
but has not completed quality and reliability verification. 


TMS _ Fully qualified production device. 


Support Tool Development Evolutionary Flow: 


TMDX Development support product that has not yet completed Texas 
Instruments internal qualification testing. 


TMDS Fully qualified development support product. 


TMX and TMP devices and TMDX development support tools are shipped 
against the following disclaimer: 


“Developmental product is intended for internal evaluation purposes.” 


TMS devices and TMDS development support tools have been fully character- 
ized, and the quality and reliability of the device has been fully demonstrated. 
Texas Instruments standard warranty applies. 


a IR AR aaa a aR | 
Note: 


Predictions show that prototype devices (TMX or TMP) will have a greater 
failure rate than the standard production devices. Texas Instruments recom- 
mends that these devices not be used in any production system because 
their expected end-use failure rate is still undefined. Only qualified produc- 
tion devices are to be used. 


TI device nomenclature also includes a suffix with the device family name. 
This suffix indicates the package type (for example, N, FN, or GB) and temper- 
ature range (for example, L). Figure I-1 provides a legend for reading the com- 
plete device name for any TMS320 family member. 


Development Support 


Device and Development Support Tool Nomenclature 


Figure I-1. TMS320 Device Nomenclature 


TMS 320 C 25 GB L 


PREFIX L__sewpERaTURE RANGE 
TMX = experimental device H= Oto 50°C 
TMP = prototype device L= Oto 70°C 
TMS = qualified device S = -55 to 100°C 
SMJ = MIL—STD-883C M = -55 to 125°C 
A = -40 to 85°C 
PACKAGE TYPE 
DEVICE FAMILY N = plastic DIP 
320 = TMS320 Family J = ceramic CER-DIP 
JD =ceramic DIP 
side-brazed 


GB =ceramic PGA 
TECHNOLOGY FZ =ceramic CER-QUAD 
C=CMOS FN =plastic leaded CC 
E = CMOS EPROM FD =ceramic leadiess CC 


DEVICE 


Figure I—-2 provides a legend for reading the part number for any TMS320 


hardware or software development tool. 


Figure |-2. TMS320 Development Tool Nomenclature 


TMDS 32 4 281 0-0 2 


QUALIFICATION STATUS 


TMDX = prototype 
TMDS = qualified 


DEVICE FAMILY 
32 = TMS320 family 


PRODUCT TYPE 


4 = software 
6 = hardware 
8 = upgrade 


MODEL? 
11 = XDS/11 
22 = XDS/22 
88 = upgrade kits 


OPERATING SYSTEMt 
02 = C1x VAX/VMS 
08 = C1x IBM MS/PC-DOS 
22 = C2x VAX/VMS 
28 = C2x IBM MS/PC-DOS 
32 = C3x VAX/VMS 
38 = C3x IBM MS/PC-DOS 
42 = C4x VAX/VMS 
48 = C4x IBM MS/PC-DOS 
52 = C5x VAX/VMS 
58 = C5x IBM MS/PC-DOS 


+t Software only. 
+ Hardware only. 


a MEDIUMt 
2 = 5-1/4” floppy disk 
8 = 1600 BPI magnetic tape 


S/W FORMATtT 
0 = object code 
1 = source code 


SEQUENCE NUMBER?* 


GENERATION* 
1 = C1x 
2 = C2x 
3 = C3x 
4=C4x 
5 = C5x 
FORMATt 
1 = Tl-tagged 
5 = COFF 


Development Support 


1.2 Hewlett-Packard E2442A Preprocessor ’C5x Interface 


The Hewlett-Packard E2442A ’C5x preprocessor interface provides a me- 
chanical and electrical connection between your target system and an HP log- 
ic analizer. Preprocessor hardware captures processor signals and passes 
them to the logic analyzer at the appropriate time, depending on the type of 
measurement you are making. With the preprocessor plugged in, both state 
and timing analysis is available. Two connectors are loaded onto the prepro- 
cessor to facilitate communications with other debugging tools. You can use 
a BNC connector, when used with the sequencer of the logic analyzer to halt 
the processor on a condition. Then the 'C5x HLL debugger can be used to ex- 
amine the state of the system (for example, microprocessor registers). Like- 
wise, a 14-pin connector is available to receive signals from the XDS510 de- 
velopment system. These signals can be used when defining a trigger condi- 
tion for the analyzer. 


The HP E2442A includes software which automatically labels address, data 
and status lines. Additionally, a disassembler is included. The disassembler 
processes state traces and displays the information on TMS320 mnemonics. 


1.2.1 °C5x Devices Supported 


1.2.2 Capabilities 


The Hewlett-Packard E2442A preprocessor ’C5x interface supports the ’C50, 
‘C51, and ’C53 devices. 


The preprocessor supports three modes of operation: in the first mode, State 
per Transfer, the preprocessor clocks the logic analizer only when a bus trans- 
fer is complete. In this mode, wait and halt states are filtered out. In the second 
mode, CLKOUT1 clocks the analyzer every time the microprocessor is 
clocked. This mode captures all bus states. An example application would be 
to locate memory locations that do not respond to requests for data. In the third 
mode, you can use the HP E2442A to make timing measurements. 


The JTAG TAP (test access port) controller can be monitored in realtime. TAP 
state can be viewed under the predefined label TAP. 


1.2.3 Logic Analyzers Supported 


HP 1650A/B 

HP 16510B 

HP 16511B 
HP16540/41 (A/D) 
HP16550A 

HP 1660A/61A/62A 


OOUVOO 


Hewlett-Packard E2442A Preprocessor ‘C5x Interface 
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1.2.4 Pods Required 


There are eight pod-connectors on the preprocessor. Three are terminated 
and best used for state analysis as all signals needed for disassembly are 
available. The other five connectors are not terminated and contain all proces- 
sor signals, including asecond set of the signals needed for disassembly. This 
allows you to double probe these signals, making simnultaneous state and 
timing measurements. 


1.2.5 Termination Adapters (TAs) 


Of the eight pods, three are terminated. You may need to order up to five ter- 
mination adapters, depending on how many pods are connected at the same 
time. 


1.2.6 Availability 
For more information and availabilty of the Hewlett-Packard E2442A contact: 


Hewlett-Packard Company 
2000 South Park Place 
Atlanta, GA 30339 
(404) 980-7351 


1-6 Pinout and Signal Descriptions 


ABS instruction, 4-27 
ACCB, 3-5 
See also accumulator 
ACCH, 3-5 
See also accumulator 
ACCL, 3-5 
See also accumulator 
accumulator, 3-2, 3-5, 3-24—3-27, 7-9 
adaptive filtering, 1-4, 7-38 
ADC devices, F-20 
ADCEB instruction, 4-29 
ADD instruction, 4-30 
ADDB instruction, 4-33 
ADDC instruction, 4-34 
addition, 7-20, 7-31 
addition example, 7-21 
address bus, 2-3, 3-4 
address generation, 3-30 
address map, data page 0, 6-14 
address visibility, 3-39, 3-53, 3-56, 6-9 
addressing modes, 3-11 
circular, 7-13, 7-44 
direct, 3-12 
indirect, 3-13 
long immediate, 3-14—3-16 
memory-mapped, 3-12 
register access, 3-14 
registered block memory, 3-16 
short immediate, 3-13 


ADDS instruction, 4-36 
ADDT instruction, 4-38 
ADRK instruction, 4-40 
ADTV, F-18 

ALU instruction steps, 3-22 


Index 


analog interface 
converters, F-4 
peripherals, F-1 
AND instruction, 4-41 
ANDB instruction, 4-43 
APAC instruction, 4-44 
APL instruction, 4-45 
applications, 1-3—1-4, 1-10, 7-36 
ARB, 3-5 
See also auxiliary registers 
architectural overview, 3-2 
architecture, 1-5 
ARCR register, 3-5, 3-19, 6-16 
arithmetic logic unit (ALU), 3-2, 3-5, 3-24 
arithmetic operations, 3-2 
ARP pointer, 3-5 
See also auxiliary registers 
assembly language instructions, 4-1 
auxiliary register arithmetic unit (ARAU), 3-5 
See also auxiliary registers 
auxiliary register file, 3-5, 6-24 
See also auxiliary registers 
auxiliary registers, 3-5, 3-16—3-20, 3-37, 4-4—4-9, 
4-40, 6-15 
circular buffer 1, 3-40, 4-12 
circular buffer 2, 3-40, 4-12 
file, 3-5, 6-24 
pointer, 3-5, 3-39 
pointer buffer, 3-39 


B instruction, 4-48 

BACC[D] instruction, 4-49 
backtracking algorithm, 7-44 
BANZ[D] instruction, 4-50 
BCND example, 7-9 
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BCND instruction, 4-52 

BIG bit, 5-11, 6-32 

BIM, 3-6 

BIO pin, 2-5, 5-14 

BIO timing, 5-14, A-17—A-18 

BIT instruction, 4-54 

bit manipulation, 3-24, 3-51-3-53 

bit-reversed addressing, 4-5, 7-46 

BITT instruction, 4-56 

BLDD example, 6-37 

BLDD instruction, 4-58 

BLDP example, 6-38 

BLDP instruction, 4-64 

block diagram, 3-3 

block moves, 3-20—3-22, 4-20, 6-37—6-39, 
7-15—7-17 

block repeat, 3-5, 3-46—3-48, 6-17, 7-15 
address register, 3-5 

BLPD example, 6-38 

BLPD instruction, 4-67 

BMAR register, 3-5 
See also block moves; block moves 

boot loader, 6-10, 6-40—6-44 

boot ROM, 6-2 

boot routine, 6-40 

BR pin, 2-3, 2-5, 3-6, 6-29, 6-33-6-35 

BRAF bit. See block repeat 

branch execution, 3-32 

branches, 3-32, 4-20 

BRCR register. See block repeat 

BSAR instruction, 4-72 

burst mode (serial mode), 5-20 

burst mode (serial port), 5-23—5-26 

bus protocol, E-3 


C (carry) bit, 3-6, 3-24—3-27, 3-39 

C25 instruction compatibility, D-9 

‘C25 packages, D-2 

‘C25 to ’C5x clocking, D-5 

‘C25 to 'C5x execution times, D-8 

C25 to ’C5x pins/signals, D-4 

‘C25 to 'C5x software compatibility, 3-59, 4-6, 4-257 
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’C25 to TLC32047 interfacing, F-3 
’C2x to 'C5x mapping, 4-257 
’C2x to 'C5x migration, D-1—D-12 
CALAID] instruction, 3-49, 4-73 
CALL[D] instruction, 4-75 
CARx register, 3-40 
See also circular buffer 
CBCR register, 3-38 
See also circular buffer 
CBERx register, 3-6 
See also circular buffer 
CBSRx register, 6-16 
See also circular buffer 
CC[D] instruction, 4-77 
CENBx register. See circular buffer 
central arithmetic logic unit (CALU), 3-6, 3-22-3-29 
central processing unit (CPU), 1-1, 3-1, 3-50 
characteristics of 'C5x processors, 1-6 
circular addressing, 7-12-7-14 
circular buffer, 3-20, 4-12, 6-25, 6-26, 7-12-—7-14 
control register, 6-26 
CLKIN2 pin, 2-7 
CLKMD1 pin, 2-6, A-10—A-13 
CLKMD2 pin, 2-6, A-10—A-13 
CLKOUT1 pin, 2-6 
CLKR pin, 2-8, 5-15 
CLKX pin, 2-8, 5-15 
clock characteristics, A-10 
clock options, A-11 
CLRC instruction, 4-79 
CMPL instruction, 4-81 
CMPR instruction, 4-82 
CNF bit, 3-6, 3-37, 3-40, 6-5, 6-12 
combo interface, F-6 
combo interface timing, F-7 
compatibility, 1-8 
conditional branch, 3-31 
consumer electronics, F-18 
context save/restore, 3-59, 5-7, 7-4 
context switching, 1-8, 7-4 
continuous mode (serial port), 5-27—5-29 
convolution, 1-4, 3-2 
correlation, 3-2 
COUNT register, 3-8 
CPGA package, D-2 


CPL instruction, 4-83 
CRGT example, 7-9 
CRGT instruction, 4-86 
CRLT example, 7-9 

CRLT instruction, 4-87 
crystals, G-4 

CWSR register, 5-12, 6-18 
cycles, C-1 


DAC devices, F-20 
DARAM, 4-24, 6-2, 6-12, C-2 
data bus, 2-3, 3-3, 3-6 
data converters, F-15 
data memory, 3-4, 3-6, 3-51, 6-12 
page pointer, 3-6, 6-19 
data moves. See block moves 
DBMR register, 3-7, 3-51, 6-17 
delayed branches, 3-32, 7-18 
development tool nomenclature, |-4 
device nomenclature, !-3 
digital audio, F-19 
direct addressing mode, 3-12, 4-2—4-4 
divide-by-one clock, 5-48, D-6 
divide-by-one-clock, A-12 
divide-by-two-clock, A-11 
division, 4-232, 7-27 
fractional, 7-30 
integer, 7-29 
division example, 7-29 
DLB bit, 5-18, 5-20, 5-20 
dma (data memory address) register, 3-6—3-8 


DMA (direct memory access), 6-33-—6-36 
address ranges, 6-36 
master/slave configuration, 6-33—-6-36 


DMOV instruction, 3-21, 4-88 

DP register, 3-6, 3-40, 4-2—4-4, 6-19 
DR pin, 2-8, 5-15 

DRB bus, 3-6 

DRR register, 5-16, 6-18 

DS pin, 2-4, 6-27 

dual-access RAM. See DARAM 

DX pin, 2-8, 5-15 


DXR register, 5-16, 6-18 
dynamic programming, 7-42—7-44 


echo cancellation, 1-4 
electrical specifications, A-1 
EMUO pin, 2-9, E-2 
EMU1 pin, 2-10, E-2 
emulator, E-1 

buffered signals, E-9 

cable header, E-2 

cable pod, E-4 

header signals, E-2 

interface, E-5 

timing, E-11 

timings, E-6 

unbuffered signals, E-9 
error conditions 

serial port, 5-29 

TDM serial port, 5-41 
example 

serial port, 5-32 

TDM serial port, 5-41 
EXAR instruction, 4-90 
Ext, 4-24, C-2 
extended-precision arithmetic, 3-25, 7-20 
external crystal, A-10 
external DMA, 6-33 

See also DMA (direct memory access) 


external DMA timing, A-20 
external flag (XF) timing, 5-14, A-18 
external memory interface, 6-11, 6-28, A-14 


fast Fourier transforms (FFT), 1-4, 7-45—7-54 
complex, 7-53 
filtering, 1-3, 3-2 
filters 
adaptive, 7-38 
FIR, 7-39 
IIR, 7-40—-7-42 
switched capacitor, F-4, F-8 
fixed-point generations, 1-2 
floating-point 
addition, 7-31 
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floating-point (continued) 
arithmetic, 7-31 
generations, 1-2 
multiplication, 7-34 

FO bit, 5-18—-5-20 

four-level pipeline, 3-34 

Fourier transforms, 1-4, 7-45—7-54 

fractional division, 7-30 

fractional multiplication, 7-27 

FREE bit, 5-19, 5-23, 5-46 

FSM bit, 5-18, 5-20 

FSR pin, 2-8, 5-15 

FSX pin, 2-8, 5-15 

functional block diagram, 3-3 


global memory, 6-29-6-30 
addressing, 6-30 
configurability, 6-29—6-30 
external interfacing, 6-30 
map, 6-29 
global memory allocation register (GREG), 3-7, 3-7, 
6-17, 6-29-6-31 


hardware multiplier, 3-4, 3-27 
hardware stack, 3-2, 3-9, 3-58 
hardware tools, I-1 

Harvard architecture, 1-5 

HDTV, F-18 

Hewiett-Packard interface, I-5 
HM bit, 3-7, 3-40, 3-50, 6-34 
hold mode, 3-7, 6-34 

HOLD pin, 2-5, 3-54, 6-33-6-36 
HOLDA pin, 2-5, 3-54, 6-33-6-36 


1/0 
boot mode, 6-43 
interfacing, 5-9, 6-31-6-33 
parallel, 1-6, 5-9 
pins, 5-14 
ports, 5-11 
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I/O (continued) 

ports addressing, 6-31 

serial, 1-6, 1-9, 5-15—5-34, 5-35-5-44 

space, 6-31 

port hole, 6-18 

TACK pin, 2-5, 3-54, 5-6, 6-9, 6-10 
TAQ pin, 2-5, 6-33-6-36 
IDLE, 3-50 
IDLE instruction, 4-91, D-9 
IDLE2 instruction, 3-50, 4-92 
IEEE 1149.1, E-3 
IFR register, 3-7, 3-56, 5-2, 5-6, 6-17 
image processing, 1-4 
immediate addressing mode, 3-15, 4-9—4-10 
IMR register, 3-7, 3-57, 5-2, 5-6, 6-17 
IN instruction, 4-93 
INO bit, 5-19 
IN1 bit, 5-19 
indirect addressing mode, 3-13, 3-16—3-20, 4-4—4-9 
indirect addressing routine, 6-21 
INDX register, 3-7, 3-19, 6-16, 6-25 
infinite impulse response (IIR) filters, 7-40 
initialization 

peripherals, 5-8 

processor, 3-53, 7-2 
initialization routine, 7-52 
instruction acquisition (IAQ) timing, A-18 
instruction cycle timings, 4-24, C-1 
instruction descriptions, 4-22 
instruction operands, 3-11 
instruction set, symbols and abbreviations, 4-14 
instruction set summary, 4-16—4-22 
instruction symbols, 4-15 
instrumentation, 1-4 
INT# interrupt, 3-7, 3-55, 5-4—5-8 
INT16, 3-53 
integer division, 7-28—7-30 
integer multiplication, 7-24, 7-26 
interfacing, |/O ports, 5-9 
interfacing memories 

EPROM, 6-11 

global memory, 6-30-6-31 

RAM, 6-11, 6-28 
internal hardware summary, 3-5—3-9 
internal oscillator, A-10 


interprocessor communications, 5-37, 5-42 


interrupt acknowledge signal (IACK), D-6 
See also |ACK pin 
interrupt context save, 3-58 
interrupt latency, 3-57 
interrupt logic, 5-7 
interrupt mode, 3-40 
interrupt timing, A-17—A-18 
interrupt trap, 3-59 
interrupts, 3-53-—3-60, 5-4—5-8, 7-4—7-6 
external, 1-10 
location, 3-55 
operation, 3-54—3-60, 5-4—5-8 
priorities, 3-55, 5-5 
priority, 5-4 
vectors, 3-56, 6-7 
INTM bit, 3-7, 3-40, 3-53, 3-57, 5-7 
INTR example, 7-5 
INTR instruction, 4-95, 7-4 
INTx pin, 2-6, 5-7 
IOWSR register, 6-18 
IPTR pointer, 3-7, 3-40, 3-56 
IREG register, 3-18, 3-30 
IS pin, 2-4, 5-9 


JTAG, E-1 
scanning logic, 1-10 
signals, 2-9, E-3 


key features, 1-7 


LACB instruction, 4-97 
LACC instruction, 4-98 
LACL instruction, 4-101 
LACT instruction, 4-103 
LAMM instruction, 4-105 
LAR instruction, 4-107 


latency 
interrupt, 3-37, 3-57 


pipeline, 3-35 
LDP instruction, 4-110 
LMMR example, 6-39 
LMMR instruction, 4-112 
load circuit, A-8 
local data memory, 6-12 
address map, 6-13 
addressing, 6-19 
direct addressing, 6-20 
external interfacing, 6-27 
indirect addressing, 6-21 
indirect auxiliary register example, 6-23 
long immediate addressing, 6-22 
memory-mapped addressing, 6-20 
registered block memory addressing, 6-22 


logic high, A-9 

logic low, A-9 

logical and arithmetic operations, 7-7—7-11 
long immediate mode, 3-14—3-16, 6-22 
low-power mode, 3-50, 5-48 

LPH instruction, 4-114 

LST instruction, 4-116 

LT instruction, 4-119 

LTA instruction, 4-121 

LTD example, 7-42 

LTD instruction, 4-123 

LTP instruction, 4-125 

LTS instruction, 4-127 


MAC instruction, 3-28, 4-129 
MACD example, 7-40 
MACD instruction, 4-132 
MADD instruction, 4-136 
MADS instruction, 4-140 
MAR instruction, 4-143 
masked parts, H-2 
maximum ratings, A-7 
MCM bit, 5-18 
mechanical data, A-27 
memories, G-2 
memory 

addressing modes, 3-11, 4-2—4-13 

configurability, 6-5 

data, 3-6 
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memory (continued) 
DMA, 6-33-6-36 
external, 6-2 
global, 3-7, 6-29 
internal, 3-10 
local data, 6-12-6-28 
management, 6-37—6-39 
maps, 6-3 
organization, 3-10-3-21 
program memory, 6-5—6-11 
security, 1-9, 6-9 
memory addressing modes, 4-2—4-13 
direct addressing, 3-12 
immediate addressing, 3-15, 4-9—4-10 
indirect addressing, 3-19, 4-4—4-9 
memory interface, 6-10, 6-27, B-3—B-5 
memory space, 6-2 
memory-mapped 
core processor registers, 6-14 
I/O ports, 6-15 
peripheral registers, 6-14—6-16 
register addressing, 4-10—4-11 
registers, 3-10, 5-2 
write, 3-35 
microcall stack (MCS), 3-7 
microcomputer mode, 2-6, 3-7, 3-40, 6-3-6-5 
microprocessor mode, 2-6, 3-7, 3-40, 6-3—6-5 
MMPORT, 4-24, C-2 
MMR, 4-24, C-2 
modem, 1-4, 7-36, F-15 
MP/MC bit and pin, 1-8, 2-6, 3-7, 3-40, 6-3-6-5, 
6-40 
MPY instruction, 4-145 
MPYA example, 7-42 
MPYA instruction, 4-148 
MPYS instruction, 4-150 
MPYU instruction, 3-29, 4-152 
multiconditional branch, 3-31, 7-8 


multimedia applications, F-2 
multimedia-related devices, F-4 
system design, F-2 

multiplexer, 3-7 

multiplication, 7-23 
algorithm, 7-25 
floating point, 7-34 
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fractional, 7-27 

integer, 7-24, 7-26 

matrix, 7-10—7-20 
multiplication example, 7-26 
multiplier, 3-2, 3-7, 3-27 
multiply accumulate, 3-28, 7-38 
multiprocessing, 6-29, 6-33 
multiprocessor configuration, 6-29, 6-33, E-8 
multiprocessor serial communications, 5-35 


NDX bit, 3-7, 3-40, 4-6 

NEG instruction, 4-154 

nested loops, 3-47, 7-11 

NMI instruction, 4-156 

NMI pin, 2-6, 3-59 

nomenclature, I-2 

nonrepeatable instructions, 3-45 

NOP instruction, 4-157 

NORM instruction, 3-37, 4-158, D-8 

not meaningful to repeat instructions, 3-44 


OFF pin, 2-10 

on-chip memory, 1-3, 1-5—1-7, 6-2 
on-chip RAM, 1-9, 6-2, 6-36 
on-chip ROM, 1-8, H-1 

on-chip memory, 1-6 

on-chip ROM, 6-2 

opcode summary, 4-263 
opcode symbols, 4-262 
operand conditions, 3-31 
operating conditions, A-7 

OPL instruction, 4-161 

OR instruction, 4-164 

ORB instruction, 4-167 

OUT instruction, 4-168 

OV bit, 3-8, 3-40 

overflow saturation mode, 3-25 
OVLY bit, 3-8, 3-40, 6-3, 6-12 
OVM bit, 3-8, 3-25, 3-41 


PAB, 3-30 
PAC instruction, 4-170 
packages, 1-6, 1-10, D-2 
packet frequency, 5-15, 5-25 
packing, 7-8 
PAER register, 3-5, 3-46 
parallel boot mode, 6-41 
parallel 1/O ports. See I/O, parallel 
parallel logic unit (PLU), 1-8, 3-2, 3-8, 3-51, 7-7 
parallelism, 3-3, 4-25 
PASR register, 3-6, 3-46 
PAx port, 6-18 

See also I/O, parallel 
PC, 3-8, 3-30, 6-9 
PC environment (DMA), 6-35 
PDA, 4-24, C-2 
PDWSR register, 6-18 
PE, 4-24, C-2 
period register (PRD), 5-46 
peripheral control, 5-2 
peripheral interfacing, D-11 
peripheral reset conditions, 5-8 
PFC bit, 3-8 
pinout, 2-2 
pinouts, A-2—A-6 
pipeline operation, 3-34 
PLCC package, D-2 
PM bits, 3-9, 3-27, 3-41 
PMST, 7-4 
PMST register, 3-9, 3-38, 4-11, 6-5, 6-17 
POP instruction, 4-171 
POPD instruction, 4-172 
postscaling shifter, 3-8 
power-down mode, 3-50 
PR, 4-24, C-2 
PRD register, 5-45, 6-18 
prefetch counter, 3-8 
PREG register, 3-8, 3-27 
preprocessor interface, |-5 
prescaling shifter, 3-8 
priorities, interrupt, 3-55 


processor initialization, 7-2 

product shift mode, 3-41 
See also PM bits 

product shifter, 3-8 

program bus, 3-3 

program counter. See PC 

program execution, 3-30, 6-37 

program memory, 1-9, 6-5 
address bus, 3-8 
address map, 6-7 
configuration control, 6-6 


PS pin, 2-4, 6-10 

PSA, 4-24, C-2 

PSC bits, 5-46 

PSHD instruction, 4-174 
PUSH instruction, 4-176 


quad flat package (QFP), A-27 


R/W pin, 2-4, 6-10, A-15, B-2 
RAM bit, 3-6, 3-41 
RAM blocks, 4-25, 6-2 
RD pin, 2-4, 6-10, 6-27 
READY pin, 2-4, A-17 
ready timing, A-17 
See also READY pin 
receiving multiplexer (serial port), 5-21 
register access mode, 3-14 
registered block mode, 3-16 
registers 
auxiliary, 3-5, 3-16—3-20 
memory-mapped, 3-10, 5-2, 6-13 
peripheral, 5-3 
repeat, 6-16 
serial port, 5-16, 6-18 
software wait states, 5-11, 6-18 
status and control, 3-38 
TDM Serial port, 5-37, 6-18 
timer, 5-45, 6-18 
repeat, 3-42, 3-46 
repeat blocks. See block repeat 
repeat loops, 3-41, 7-15—7-17 
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repeatable instructions, 3-42 
reserved pins, 2-10 
reset condition 

CPU, 3-53 

peripherals, 5-8 
reset timing, A-17—A-18 
RET[D] instruction, 4-177 
RETC instruction, 4-179 
RETE instruction, 4-181 
RETI instruction, 4-182 
right shift, 3-27, 3-28—3-30 
RINT interrupt, 5-17 
RMS routine, 7-18 
robotics, 1-4 
ROL instruction, 4-183 
ROLB instruction, 4-184 
ROM, 4-24, C-2 
ROM codes, 1-8, H-2 
ROR instruction, 4-185 
RORB instruction, 4-186 
RPT example, 7-16, 7-40 
RPT instruction, 4-187 
RPTB instruction, 4-190 
RPTC register, 3-9, 3-42, 5-2, 7-15 
RPTZ instruction, 4-191 
RRDY bit, 5-19 
RRST bit, 5-18, 5-20 
RS pin, 2-6, 3-54, 3-56, 7-2, D-5 
RSR register, 5-16 
RSRFULL bit, 5-19, 5-22 


SACB instruction, 4-192 
SACH instruction, 4-193 
SACL instruction, 4-195 
SAMM instruction, 4-197 
SAR instruction, 4-199 
SARAM, 4-24, 6-2, C-2 
SATH example, 7-31 
SATH instruction, 4-201 
SATL example, 7-31 
SATL instruction, 4-202 
SBB instruction, 4-203 
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SBBB instruction, 4-204 
SBRK instruction, 4-205 
scaling, 3-2, 3-23 
scratch-pad RAM, 6-13, 6-18 
search algorithm, 7-9 
security feature, 6-9 
serial boot mode, 6-43 
serial port, 1-10, 5-15—5-34 
block diagram, 5-17 
control register, 5-18 
error conditions, 5-29 
example, 5-32 
external transmit timing, A-23 
internal transmit timing, A-24 
one-way transfer, 5-16 
operation, 5-15 
pins, 5-15 
receive timing, A-22 
receiving multiplexer, 5-21 
registers, 5-16, 6-18 
reset, 5-20 
timing, A-22 
servo control-related devices, F-13 
servo control/disk drive applications, F-12 
SETC instruction, 4-206 
SFL instruction, 4-207 
SFLB instruction, 4-208 
SFR instruction, 4-209 
SFRB instruction, 4-210 
shadow registers, 3-58, 4-182, 7-4 
shift modes, 3-27 
short immediate mode, 3-13 
sign-extension mode, 3-41 
signal descriptions, 2-1—2-10, A-2—A-6 
single-access RAM. See SARAM 
SMMR example, 6-39 
SMMR instruction, 4-212 
sockets, G-3 
SOFT bit, 5-19, 5-23, 5-46 
software stack, 7-6 
software tools, I-1 
software wait states, 5-10, 6-18, D-7 
SPAC instruction, 4-214 
SPC register, 5-16, 5-18, 6-18 
specifications, 1-7, A-1 
speech encoding, F-3 


speech memories, F-10 

speech synthesis applications, F-10 

SPH instruction, 4-215 

SPL instruction, 4-217 

SPLK instruction, 4-219 

SPM instruction, 4-220 

SQRA instruction, 4-221 

SQRS instruction, 4-223 

SST instruction, 4-225 

STO register, 3-9, 3-38 

ST1 register, 3-9, 3-38 

stack 
hardware, 3-2, 3-9, 3-58 
microcall, 3-7 

status and control registers, 3-38 

status registers, 3-39 

strategic registers, 3-2 

STRB pin, 2-4, 6-10, 6-28, B-2, D-5 

strobe signal (STRB), 6-35 

SUB instruction, 4-227 

SUBB instruction, 4-230 

SUBC example, 7-29-7-31 

SUBC instruction, 4-232 

subroutines, 7-18—7-19 

SUBS instruction, 4-234 

SUBT instruction, 4-236 

subtraction, 7-20 

subtraction example, 7-22 

support tools nomenciature, |-2 

switching characteristics, A-14 

SXM bit, 3-9, 3-23, 3-41 

symbols and abbreviations, instruction set, 
4-14—4-15 

system control, 3-30—3-50 

system migration, D-1—D-12 


T registers. See TREGO, TREG1, or TREG2 
TADD pin, 2-8, 5-36 

target system clock, E-7 

TBLR example, 6-39 


TBLR instruction, 3-21, 4-238 
TBLW example, 6-39 


TBLW instruction, 3-21, 4-241 
TC bit, 3-9, 3-41 
TCK pin, 2-9, E-4 
TCLKR bit, 5-36 
TCLKR pin, 2-8 
TCLKX bit, 5-36 
TCLKX pin, 2-8 
TCR register, 5-45, 5-46, 6-18 
TCSR register, 6-18 
TDDR bits, 5-45, 5-46 
TDI pin, 2-9, E-4 
TDM serial port, 5-35—5-44 
error conditions, 5-41 
example, 5-41 
four-wire bus, 5-37 
Operation, 5-35—5-44 
registers, 5-38—5-44, 6-18 
transmit and receive, 5-39-5-44, A-25-A-27 
TDO pin, 2-9, E-4 
TDR pin, 2-8, 5-36 
TDX pin, 2-8, 5-36 
TDXR register, 6-18 
telecommunications applications, F-5 
telecommunications-related devices, F-7—F-9 
test load circuit, A-8 
test/control flag, 3-41 
TFRM pin, 2-8, 5-36 
TFSR pin, 2-8, 5-36 
TFSX pin, 2-8, 5-36 
TIM register, 5-8, 5-45, 5-47, 6-18 
time division multiplexing port. See TDM port 
time-division multiplexing. See TDM serial port 
timer, 1-10, 5-45—-5-47, D-11 
timer block diagram, 5-45 
timer control register (TCR), 5-46 
See also TCR register 
timer interrupt (TIM). See TIM interrupt 
timer interrupt (TINT). See TINT interrupt 
timer registers, 6-18 
timing, D-7 
combo interface, F-7 
emulator, E-11 
external interface, B-1 
requirements, A-14, A-23 
TINT interrupt, 5-4, 5-45 
TINT rate, 5-45 
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TLC32071, F-14 

TMS pin, 2-9, E-4 
TMS320 family, 1-2 
TOUT pin, 2-7, 5-45 
TOUT timing, A-18 

TRAD register, 6-18 
TRAP instruction, 4-244 
TRB bit, 5-45, 5-46 
TRCV register, 6-18 
TREGO, 6-17 

TREGO register, 3-9, 3-27 
TREG1, 6-17 

TREG1 register, 3-7, 3-24 
TREG2, 6-17 

TREG2 register, 3-7, 4-56 
TRM bit, 3-9, 3-41 

TRNT interrupt, 5-4 
TRST pin, 2-9, E-2 

TRTA register, 6-18 
TSPC register, 6-18 

TSS bit, 5-46 

TSS interrupt, 5-45 


TTL-level 
inputs, A-9 
outputs, A-9 
TXM bit, 5-18 
TXNT interrupt, 5-4 


unpacking, 7-7 
user-maskable interrupts. See IMR register 


V.32 encoder, 7-36 
VDD pin, 2-7 
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vectors 
interrupt, 5-5 
See also interrupts 
reset. See RS pin 


video signal processing, F-19 
voice synthesizers, F-10 
VSS pin, 2-7 


wait states, 6-32, A-16—A-18 
registers, 6-18 

wait-state generator, 1-9, 5-10, 5-13 

warm boot mode, 6-44 

WE pin, 2-4, 6-10, 6-27, B-2 

word moves, 3-21 


X1 pin, 2-6 

X2/CLKIN1 pin, 2-6 

XC example, 7-18 

XC execution, 3-33 

XC instruction, 4-245 
XDS510 emulator, E-1 
XF bit, 2-5, 3-7, 3-41 
XINT interrupt, 5-17 
XOR instruction, 4-247 
XORB instruction, 4-249 
XPL instruction, 4-250 
XRDY, 5-19 

XRST bit, 5-18, 5-20 
XSR register, 5-16 
XSREMPTY bit, 5-19, 5-21 


ZALR instruction, 4-253 
ZAP instruction, 4-255 
ZPR instruction, 4-256 
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